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Fig 1 (cont) 
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Fig 1 (cont) 

GTAATATCAT 7AAA7AACAG TTTTCCAGCA CGAGGCATAA AGCTTCTTGT CAGCTTAAAC 6420 

ATTGAAAGTT CTTCACTGTC 7AT7CC77C7 AATACATGAC TTGCAAGTAT GATTTCATCA 6 480 

AAACCACAGA TATTATGACC TACTACATAT AATTCAATAT ATCTTGGTTC GCACTGTTTT 6540 

AA7777777T CTTTATTTAA GACCA7GATG TCTTCATATG ATAAATTTGA TTCAAGACCA 6600 

7GA7777CAC AAAACGTTGA CCAGTATTTT TTAGCTACTG AAATTTGTAG CTCTGTTCTG 6660 

AATTTTTTAA AAGCTATGCC AATTTCATCT TCTTTTTTAT TTAACATTAC AAAACATTCT 6 720 

C7G777ACC7 CATAACCTAT ATCGGTAGCT ATTTTAGAAG CAATTTTTAT GAGTGATTTA 6 780 

CATCCAATTA ACTTAAAAAC CAACAAGTAA GGAGTTAACT GTTTTCCATA CAAAGAATGG 68 40 

TAAGTATATG TTTCAATATC ATAAACAATA AAAAGACGTT TTGCTTTTAT GGCTCCAACT 6 900 

GGATTAAATT TGATTTTTTC CCACCAGAGT TTTGTTTCAT GGTGAATATT GTGATAATAG 6960 

AAGTCCCGTC TTCTGGATGA GCAG77G7G7 ATATTACTAT AAATTGTTCC GCAGAATTCA 7020 

CATTTATTCT GTTGTTTAAC AGTTTTTATT AAATATATTT CTCCTTTTAA AATCAATAAT 7080 

TC7ATTGGTA ACAAATTTCC ATTAAGAATT TCTTCAGTCA TCTTAAAAAA TCTTTTGTTG 7140 

AACTTCCATA TTTTTAAAGA TACGGGGGTG TTAGAATCAC AAAGTTTTAA AACATCTAAA 7200 

ACATTTTCTA CTTTCTTGAA AGAATTTAAT TTTAAACCCT GAATTGCAAA GTAATTATAA 7260 

AAACTTTTTT CAAAATTCTT GTAG7ATATA A77777ATAT A7G7ATCC7C A7A7AT7CCA 7320 

G7AA7A7AAG 7AG7AG77C7 77GC777A77 A77G7C77TG AAGCCA7C7G T77AAAGCCG 7380" 

C77CCCG7AC 7CGC7CAAAG CTTCTTAAAA CAAC77CA77 TG7ACTATAG CCAACAA77C 7440 . 

CAGACAA777 7A77C7AAA7 GC7A777CAA C7GAA7C7AA A7C7GAAAAA TCCG7G7T7A 7500 

C77GG77GA7 7AC77C77C7 A7GC7CCCAC 7G7C77CTAC GAAG7CTA7A TC77GAAG7A 7560 

A77GG7C7C7 77C77C7GGA G77GAAAAAG AG7AAGA7C7 T7CA7TAGC7 7C7A7AAT7C 7620 

C7AAAAAA7C ACGAG77A77 C7GCTA7A7A G7TG7C7GAA 7GC7TGTG7T 7C7CTAT7AA 7680 

ACCAAAC7C7 AG 7 AAA TAT A TCTTC7CCA7 7TTCA77TC7 ACCTCTTAAT A7AATTTGAA 7740 

CAAA77GGA7 TCCAATATTT CTGGCAGCTA ACC7AT7TTG CAC7AAATTT AAG7ATAAG7 7800 

AA7A7AGCG7 GCTTGCCACA 7 GC 7 C7AA7A TAAAGAAATA CACTAACCAT TTT7GAATAA 78 60 

AA7CA7CAG7 CAATC7A777 7CA7TA7AAA A7C7AA7AAG TAATTGAAAA AA77CACT7C 7920 

CG7AA77AAA AAAA77ACTC C7TC77GC77 CAGGAG7TAA 77C7TCTTCT AAA7777GAA 7980 

77AAA7C7AC 7A77GAAGC7 A7CAC77CA7 CAT7AAA77C 7TCCC7AC7C AGA7CGC77G 8040 

AGC7CGGC7C GCGA7C7GAA AA7CC77CA7 C77C7AT77C AGGAACAGTA AGAGGAGAAC 8100 

TAGAAGTTTC 77CAACA77C C77ACCC777 GGCG7C7A77 AACAGGTAAT CTA7CAA7AA 8160 

A7C77C7GA7 7ACA7CACCC C77GAACG7C 7CA77A77TC AGTAATAGCT C7A7AA7777 8220 

CCC7AGG7C7 7AATCTGAA7 GG7AA7CC7A C7C7TG7CCC 7GACC77AAA G77AA7GC7C 8280 

CACCA7GCA7 CCCACCTTTT CCTAAAG77A A7ACAG7TGC 7AAA7C7777 AAATTAATTC 8340 

GA77T7CAGC 7TC7GGAAT7 7CCAGC7G7G AAAA77CA7C TATAAAAAGC TCAA7CCAGA 8400 

A77CAGAAAA AGG7AAG7C7 AA7A7ACA77 CACTA77ATG CATG77AGAC AAAA77AAAA 8460 

A777ACATAA AGCTTTTTTA ATTT7ACAAA 77AAC7TTA7 AAGGTAAG7A 7CCC777C77 8520 

GCAAA777AA AACCA7AAAA GCT7GAGAAA AAGGT7GATA ATGC7GC7GA AAAGATCTAT 8580 

TCTGA7777G AGCTGAAATA GCGGAGCCAA AACC77GCA7 GTC7GCAAG7 TGCAGACTCC 8640 

C7AATA77C T A7CCA T 7AAA ACCGCG7T77 GAA777GAC7 AA77G777G7 GAAAAA7777 8700 

C7ACA7777G AA77GC7C7C ATATATGACC CAG7A777A7 GGAG7A7GAA CAA7CAG77A 8760 

AAA777GCCA GGTCATGCG7 CTC7CAAAAC 77A7AGG7GA AAGA7ACAAC 77A7A7GAAA 8820 

7G^TGC7G7A AGTCCGCTGX 7CAAACAGA: AC7GG777AA AAC7CGCGCC ACA7AAAAA7 8880 

ACCCAA7TAA 7AAA777GG7 GGAGG77C7C CTTCAAATGG 7GG77G7GAA G7AACAGG7C 8940 

C7C77GGGCG 7AAA7CGAGT AA7TGAG7CA CTGGATAATT AAAAAATCGA 77AGCCCA77 9000 

TTAT7CCCCT 77CA7G7A7A GTCCTTGACC 7GGCAA7AC7 7CGA77A77A AGG7CAAG7G 9060 

TTAAACGTAA ATA7CGTAAG G7A7G7-7GAC 777GCCCAG7 GAG77G77GC JA77GG7GAA 9120 

7C7GCAAGGC AAACAAAAAA 777A7G7TAT 7AC7GCAGA7 GCA7CC7Ai 7 T7ACAAAA.. 9180 

TACG77CA7C ATTGGAAACT CCAGAC7TA7 CAAGCAACTC CCCGGGCACG xCAAATAAAA 9240 

A 7 G AAAAA G A TGAATTTGAA CCAGCAG77G GCA7T7CTAG CAAACCATCT GATGAA.T.A 9300 

ATATGAGACG ATCTCAAAGA GA7GA7AA77 TACCTAAAAG 7CAGA7ACCA JTAGTAGATA 9360 

7AC7ACA7GA TAAAAA7CC7 AAAA7GGCAG 'AAGAAC GAGA CTTAATGTAT J^TuTTCT. 9420 

C77GCA7AAA AC77GA7GA7 7C7.AAAGA..7 TAAAAACTGA JA7G77CAGG «^AT.>^ 9480 

C7GGAACTAG TCCAGCTCAA AGACACATAG AAGCCGCAGA GC7AAAGAGA AAT^^A7C7. 9540 
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Fig 1 (cont) 

ATACTCGTAG TTTAGAACAA TGGACACATG ATTCTTTTAT AAGTCATGTT AAACAATTAC 9600. 
TTTCTAGACC ATTTATATCT CTAGGTATTA CATATTTGGA TGATTTTTTG CAGACTTATT 9660 
TAGATCATAC TGAATCGTCT TCTTTAAACT TTCAACTGTT TACTTTAATA AATCACTGTT 9 720 
CAGAAAATAC TTTAAAACGG ATTTTAAAAC ACATTTCTAA AAAAAATGAA AAAAATCAAT 9 780 
ATGTAAATCA ATGGTTGATT GATCTCATTA CATGTATATA TCTAATTATA AGAGATGAAC 9840 
AAAATGTTAC AGAACAAGTT AATGCCCTTT TAGTAACTAG TAATCACTTA GCTTTACATT 9900 
TTGCAAAGAA AGCTACAGGT GGATTCTATC CTACAGCAGA CAAGTTAGCG AAGACTCATA 9960 
TTTTTTTCAA GAGAATAATT TTAGGAATAC TTTCGCTAGC AGAAAGTATA GGTTGCTATA 10020 
CTGTGAATCC ATATTGCAAA AATCCTTTGA AAAAGTCAAA AGTAGAAGTA GAACCAAGTG 10080 
ACGAAATGTA TATGTTCAGC TTAAAAGGTG CACTTGAACA TCCTGATTCC GACGAAGACG 10140 
AAGACAGTGG ACTTCAAAAT GAATAATTAT CATAAATGGA CTTCTAATGT TATAGATGCA 10200 
ATTCTATCAA ACAAAGCTCT T T TAG C TATA AAAATTTTAA AAGTCAACCG TTTGCAAACA 10260 
AATTGAATGC TTTAGAATCA GCAGTTGTGC CTCCAAGAAA AGATGATACT CCTGAAATGA 10320 
TAGCAAATCT TTTAAAAGAA TTAGTTGCTT TGGGAGCTAT TCGCAGTGAT GAAGTTGGCC 10380 
CATTATATTC TGACCTTCTT ATCAGAGTTC ACAAATATAA TAGCTTGAAT GTTCAATCAA 10440 
ATTTGCAAAC TTTAACAGGA G AC ATT AAA T CACTTCAATC CGATATAATT AGAAGTTCCG 10500 
ATATTCCCAA TTTAAGTAAT CAAGTTGTTT TAAATACATT TTTAAATTCT TTGCCCTCAA 10560 
CTGTTACATT TGGACAACAT AATTATGAAG CTTTTAAACA AACTCTAAGA TTATTTGTTA 10620 
ATGAGACACC TAATATTACA GTTTTTAGAT CAGGAAATGA TACTTTAATT CAGGTTAACA 10680 
TAACAGGAAT TCATACAATT AATTTGAATG ATGCATTTAA AAATTTAAAA AATTTTTGGG 10740 
GAATAGTATT AACAGGTGAA TTTATTCCAG GTGATATTAC AAGCAGACTA ACAGCTAATA 10800 
CAAGAGTACT GCTTTATTTT CTTGCTCCTT TTACAAATGA TAATACATTC ACACCTGATA 10860 
CTTTTCTAGC TTTACTCATG AAATTATATA GATTGACAGT TTCTTCTGCT TTAGATTTTG 10920 
AAGAAGAAAC TGAAGCTGAA GTAGAAAATG TAGCTCAACA AATAGGATCC ACTAGTGCAG 10980 
ATTTTACAAA GACTTTAGGA TATCTATTAA AAAACAAAGA AGAATCATTT TCGCCTCCCA 11040 
AATCATTATC TCCTAGACAA CTGGGTATTT TAAGGTTCAT ACAGAAAAGT CTGGTAGATA 11100 
AAATTGATAG AAATAATGAA GATCCATGGG ATGCTTTAGA AACTTTATCT TATTCATTTT 11160 
CTGCGTCATT TTATGAGGCC AATGGGCCTT TTATTAGACG GTTAATAACT TATATGGAAT 11220 
TTGCCTTACG TAATTCTCCT ACTTACTTCA GAGAAATTTA CTCCAAGAAA TATTGGATAC 11280 
CACCCAATTC ATTTTGGACT CAAAATTATG CAGACTTTTT TTCGGAAAAG AAAGAAAAAC 113 40 
AAAATTTCGA AACATTTGAA CCGCGGGAAC TTCCTTTACA AATCTCTGAG GAAGAAGCTG 11400 
TCCCGCATAC AGAAGATTTT CAGTCAGCCA TCTCGCCCTC TATGGGCCAA ACTTCACTCC 11460 
CTGCTCCTTC TGTGTCAGAA TACAGTAGCG TGCCTCGGTC AGCTTTTTAC CCTCTCAGAG 11520 
AACGTATCCA AGAGAGCATT TCAAAGGCAG TCATCCCTCC TTTGACAGGC TATGTCGGAA 1158 0 
AACAAATAGG TGAAACTATT TTCCCTGGTA GTGGAGATCT TGTAGCACCC GCTGCGTCTT 11640 
TAGTTGCAGC ACAATTGGTT GATTCAAGGT TTAATAACAG AAGACAAAGA TTGAAAGACG 11700 
CAGCCAGAAA GCGTCACCGC TATGTTAGAG AGATGCATAA TATTTCTGAT AAAGAGTCAA 11760 
ATGCTTCTAA TGATACGGTA ATATCACCTT TGATTGGACA TGGTTCGCGC ACTGAAAATC 11820 
GTTTTGAATA TTTGAGACCT AAAGGTGGAA ATTAT7TATA CTAATAAAAA TCATAACAGA 1138 0 
CCTGACGGGC GGTCATCCTT TTTTATTAGA TGCAGAAATT TGTACCTCCA CCACGAATCC 11940 
TTGCTCCAAC AGAGGGTAGA AACAGTATTA CTTATACGCC TCTGGCACCA CTGCAAGATA 12000 
CAACAAAAGT ATTCTTTATT GACAATAAGT CTTCGGACAT TGAAAGTTTA AACTTTACTA 12060 
ATAATCACAG TAACTTTTTT AC AAA TAT T A TTCAAAATGC TGATTTGGCA GCGGATGAAG 12120 
CAGCAACGCA AGATA7TAAA CTGGATGAAA GATCTAGATG GGGCGGTGAA CTGAAAACTT 12130 
TTATAAAAAC AAATTGCCCC AATGTTTCAG AATTT7TTAA CAGTAATAGC TTTCTAGCCA 12240 
GATTAATGGT AGATAAAAC T GATCCAGAAC ATCCTAAATA CGAATGGGTA CAAATTACAA 12300 
TTCCTGAAGG CAATTACACT GGAAGCGAAC T T A T A G A 7 C A ACTTAACAAT GGTATTTTAA 123 60 
ACAATTACTT AGAAGTGGGA C G C C AAAAAG GAG TAG AAA T TGAAGACATA GGAGTAAAAT 12420 
TTGATACAAG AGATTTTTCA CTTGGATATG ATCCTGAAAC GGGACTAATT ACTCCAGGAA 12430 
AATATACATA TAAAGCTTTT CATCCAGA7A TTATCTTGCT ACCTGAATGT G^G.AGA. i 125 40 
TTACATATTC TAGAATTAAT AATATGTTAG G T A T AA G AAA GAGATTTCCA T AT ACT AAA G 12600 
GATTTCAAAT TTTATACAGT GATTTGACGA AGGGAAATAT CTCTCCATTA CTGAATTTA^ x26o0 
ATAACTATCC TCATTCTATC GAACC7G7AA TGCAA3ACGA AAATGGAGTT AbCTAiAA^ 12720 
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Fig 1 (cont) 

TAGAAAAAAT AAGTGACAAT CCCCCCAGAT GGCAAACAAA GTACAGATCT TGGACTTTAA 12780 
GTTATAAAAA TAATGGAGGA GCTAAAGCCG TAACTGTACT AACTGTTCCG GACATAACAG 128 40 
GAGGATTAGG TCAAATTTAT TGGTCAATGC CAGATACTTT TAAAGCACCT ATTACTTTTA 12900 
CTAACAATAC TACAAAGCCA G AAA C A C T ? C CAATTGTTGG ATTACATATG TTTCCTTTAA 12960 
AAGCAGGGTT AGTTCATAAT ATAAATGCGG TTTATTCTCA ACTTTTGGAA CAAATTACAA 13 020 
ATACAACTCA AGTATTCAAT AGATTTCCTA AAAATGCTAT ACTAATGCAA CCACCTTACA 13 080 
GCACCGTAAC ATGGATAAGT GAAAATGTCC CCTTTGTTGC AGATCACGGG ATTCAGCCAT 13140 
TAAAAAACAG CCTTACAGGT G T AC AAAG AG TTACTATAAC AGACGACAGA AGGAGATCTT 13 200 
GTCCATACAT ACAGAAATCT TTGGCGACTG TTGTCCCTAA AGTACTTTCA AGTGCTACAC 13260 
TTCAGTAACA ATCTGGCTGA TATCTCTGGG CCTTATCCTC CTGGAACCGT TATGTCTATT 13 3 20 
TTAGTTAGTC CCTCTGATAA TACCGGGTGG GGTATTGGAA CATCAAGTAT GAGGGCTACT 13380 
GGCTTGAAAT TTTCTAAAAA ACAACCTGTT AGAGTGCGAC CTTATTACAG AGCTCAGTGG 13440 
GGACAGCTTA ATGCTCGTAC TTCACTTGAG AAACTAAAAA CCAAATTGAA ATATTATGAA 13500 
AAATTGTACA GGGACAGACT AAAAAGAAAA ACAGTTGTTC CAAAGAAAAA GAGGTCACCT 13 5 60 
ACATCTCCTG CGGATCGACT TAAAAAATAT CTTAAAGCTG TCAGTCAAAT CAAAGCTTTC 13 620 
AATAGAGCTA GAAGAGCAGC CCAATAAATA TTATTTTTCA CTTGCAGATG AAGGTAGTTC 13 680 
ACGTGCTTAA ATCTCCTCAT CGTCGAAGAC ATACACGTCG T7ACAAAAAA CTAAAAAAAA 13740 
TCAATCTATC TCCATACATT T TAG CT AAAG AATTGCAAGG CGGTTTTTTA CCAGCTCTCA 13800 
TTCCTATCAT AGCAGCCGCA ATTAGCGCAG CCCCTGCTAT AGCTGGAACT GTAATAGCTG 13 860 
CTAAAAATGC TAATCGTTCT TAAAATTTAG AAAACTTTTT TTTTAACAGA TCACATGGCT 13 920 
TTTTCAAGAT TAGCTCCCCA TTGCGGCTTA ACACCTGTTT ATGGCCACAC CGTTGGAATC 13980 
TGTGATATGA GAGGAGGTTT CAGCTGGTCT AGTTTGGGAA ATTCTTTTAC TTCTGGTTTA 140 40 
AGAAACATAG GTTCATTTAT ATCAAATACT GCTCAAAAAA TAGGTCAATC ACAAGGATTT 14100 
CAGCAAGCCA AACAAGGTCT ACTGCAATCA AATGTTTTAG AAAATGCAGG ACAATTAGCA 14160 
GGTCAAACTT TAAATACTTT GGTAGATATT GGAAGATTAA AGGTAGAGAA AGATCTAGAA 14220 
AAATTGAAAC AAAAAGT TAT AGGGAACGAC CAACAAATTA CTCAAGAACA ATTAGCTCAA 14280 
CTAATAGCCA GCTTAAAACC AAAAGATGAA ATGTTTGTAA AGCAATCAGA AAAAATTGTT 143 40 
GAACCTATGA GACCAGAAAT TAAATCTAGC CAAATGCCTG TAGAAATGTC TTTTTATGAT 14400 
TCTGTAAGTG ATGAACCAAT C A T AAAAAC C AAAGAAGTTA GCCCTCCTTC ATTTTCATCT 14460 
GAATCTTCAC ATTCATATTC TCACCCAAGA AAAAGAAAAC GCGTATCCGG TTGGGGTGCA 145 20 
TTTTTGGATA ACATGACTGG AGATGGAGTA AATTTTAATA CAAGAAGATA TTGTTATTAA 14580 
AAACACTTTT TATTTACAGA TGGAGCCACA GCGTGAATTT TTTCACATTG CGGGTAGAAA 14640 
TGCAAGGGAA TACTTGTCTG AAAATCTGGT ACAATTCATC TCTGCCACTC AAAGTTTTTT 14700 
TAATCTTGGA GAAAAATTTA GAGATCCTTT TGTAGCTCCA TCGACGGGTG TAACTACTGA 14 760 
CCGTTCTCAG AAACTTCAAC TTCGTATAGT TCCGATTCAA ACTGAGGACA ATGAAAACTT 14820 
TTACAAAACT AGATTTACTT TAAATGTAGG AGATAACAGA GTTGCAGATC TTGGAAGTGC 1488 0 
ATATTTTGAC ATTGAAGGAG TTATTGATAG AGGACCTACT TTTAAACCTT ATGGAGGGAC 14940 
AGCTTATAAT CCATTAGCCC CAAAATCAGC TTTTCCCAAT GCAGCTTTTA TGGATACTGA 15000 
TGAAGCTACA ACAATTTATA TTGCTCAACT CCCTAATGCT TATAATGCTC AAAACAAAGG 15060 
TGTAGAAGAA GCAATTCGAG TAGAAGCAAA CACTACTACT CCTAATCCTC AATCAGGAGA 15120 
ATATGCTACT TATGACTCTG CCAAATTTAA TCCAGAAACT ACTGGTGCTT CTGGAAGGCT 1518 0 
TTTAGGAATT AATAGCTTAG GAGA7CT7T7 TCCGGCTTAT GGATCTTATT GTAGACCTCA 15240 
ATCAGCAGAT GGTAACATTT CAACTGCACC C A T AAC T AAA GTCTATCTAA ACACTACTGC 153 00 
TAG AG AT GAG AGGGTCAGTG GAGTTACTGC AGTTGACACC GCAACCAGAT TGCATCCAGA 153 60 
TGCTCATTAT ATTGAATATA CTGATGAAGC C AAAGC TAG A GCTATAGGAA ATCGCCCAAA 15 4 20 
TTATATTGGT TTCCGAGACA ATTTTATTGG ACTCATGTTC TACAATAATG GTTCTAATGC 15430 
AGGAACATTT TCCAGCCAAA C A C AAC AAC 7 TAA7G77G77 7TAGACTTGA AT G AC AG AAA 15540 
CAGTGAACTA AGCTATCAAT A7C7AA7AGC AGA7CTGACA GA7AGGTATA GATA7TTTGC 15600 
AC77TGGAAC CAAGCAGTTG A7AG7TACGA CCAGTATG7C AGAATTTTGC ATAATGAAGG 15560 
ATATGAAGAA GCCCCTCCGG CC77A7CA77 TCC77C7CAA GGTATCCAAA A7TAT7TCAT 15720- 
GCGTACTGCG GCAGGTAA7G CGA7GACAG7 AGACACGGGT A G AAA 7 AC 7 G C AG C AAAAAC 15 730 
AGATAACACC AAGGCTTTTA TAGGATATGG CAACA7GCCA 7C7T7GGAAA TGAATC7GAC 153 40 
AGCAAATCTA C AAC G 7 AC A 7 TT77G7GG7C TAATGTAGCA A7GTATCTGC CAGA7AGGCT 15900 
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Fig 1 (cont) 

GAAAACAACA CCACCCAACA TAAATCTACC TGATGACACC AACTCTTACG GAT AT AT AAA 15960 
TGGAAGGGTC CCTCTAGCAA ACATAATAGA TACATGGACT AACATTGGGG CTAGGTGGTC 16020 
ATTAGATGTT ATGGATACTG TAAATCCATT TAATCACCAC AGAAATTCAG GACTAAAGTA 16080 
TAGGTCACAA CTGTTAGGAA ATGGAAGATA TTGCAGATTT CACATTCAAG TACCTCAAAA 16140 
ATTTTTTCCT ATAAAAAATC TTTTGTTGCT GCCAGGAACA TATAATTATG AATGGTACTT 16200 
TAGAAAGGAT CCCAACATGG TTTTTCAGTC TACTTTAGGT AACGACCTTA GAGCAGATGG 16260 
CGCAACTATT ACATACACCA ACATAAATTT ATATGTTTCA 7TTTTCCCTA TGAATTATGA 163 20 
AACAGTAAGT GAACTTGAAT TGATGTTGCG TAATGCTACT AATGATCAAA ACTTTGCAGA 16380 
TTATTTGGGT GCGGTAACTA ATCTTTATCA AATCCCAGCT AATACAAATA CTGTAGTAGT 16440 
GAACGTACCA GATAGATCTT GGGGTGCTTT CAGAGGATGG AGTTTCAATA GAAT'TAAAGC 16500 
TTCAGAAACA CCTATGATAG GAGCAACAAA AGATCCAAAT TTTACTTATT CAGGATCTAT 16560 
ACCGCTACTA GATGGTACTT TCTATTTAAC ACACACTTTT CAACGAGTTT CTATTCAGTG 16620 
GGATTCTAGC GTTCCATGGC CAGGAGATGA TAGGCTTTTG ATTCCAAATT GGTTTGAAAT 16680 
TAAGAGAGAT CCTAATATGG ACGCAGAAGG TTATACTATG AGTCAAAGTA CTATCACAAA 16740 
AGATTTTTAT TTGGTACAAA TGGCTGCTAA TTATAATCAA GCTTATCAAG GTTATAAATT 16800 
GCCAGTACAT TCTAAATATT ATGGATTTTT AGAAAATTTT CAACCTATGA GTCGCCAAGT 16860 
ACCAATTTAT GGTAATGGCA CTTATGATTT ATATACTGCT TATATTACAA ACCAAAGAAC 16920 
CATGCAAATT TGGAATAATA GTGGTTTAGA ATCTAAAACT TCAAATCCTC CTATGTTATC 16 98 0 
CAACACTGGT CATCTTTATG TAGCTAACTG GCCATACCCT TTGATTGGAC CAAATGCTAT 17040 
TGAAAACCAA CAAACTGAAA GGAAATTTTT GTGTGATAAG TATATGTGGC AGATACCATT 17100 
TTCTAGTAAT TTTTTGAATA TGGGTAATTT AACAGATTTA GGGCAAAGTG TTTTGTACAC 17160 
TAATTCTAGT CATTCACTTA ATATGGTTTT TACTGTGGAT AGTATGCCTG AAACAACTTA 17220 
TCTAATGCTT TTATTTGGTG TTTTCGACCA AGTTGTTATT AATCAACCAA CAAGAAGTGG 17280 
AATAAGTGTA GCTTATTTGC GCCTTCCTTT TTCAGCTGGT AGTGCAGCAA CATGAGCGGC 173 40 
ACATCCGAAA GTGAGCTGAA AAATCTGATT TCATCATTAC ATTTAAATAA TGGATTTTTG 17400 
GGCATTTTTG ATTGCAGATT -TCCAGGTTTT CTGCAAAAAT CTAAAATTCA AACTGCTATT 17460 
ATTAATACAG GTCCCAGAGA ACAAGGCGGA ATACACTGGA TAACATTAGC ATTAGAACCC 17520 
ATTTCTTATA AGCTATTTAT ATTTGATCCA CTCGGATGGA AAGACACTCA ATTAATTAAA 17580 
TTTTATAATT TTTCACTAAA TTCTCTTAtT AAAAGGTCGG CCTTAAATAA CTCAGACAGA 17640 
TG TAT TAG AG TAGAAAGAAA TACTCAAAGT GTTCAATGTA CCTGTGCGGG ATCGTGCGGC 17700 
TTGTTTTGTA TATTTTTCTT ATACTGTTTT CACTTTTATA AACAAAATGT ATTTAAAAGT 17760 
TGGCTTTTTC AAAAATTAAA CGGTTCAACC CCTTCTCTGA TCCCATGTGA ACCACATCTA 17820 
TTACATGAAA ACCAGACATT TCTTTATGAT TTTTTAAATG CAAAAAGTGT TTATTTTCGA 17880 
AAAAATTATA GAACATTTAT TGAAAATACT AAGACTGGAT ■ TAATAAAAAC ACATTAATTG 17940 
TATTCTTGCT TTTTGACGTT TTCATTAGTC TTCATCTTCA TCTTCTTCTT CACTGCTAGA 18000 
TTCCAAGATG GTTTTTTTTT TCTTTGATGG AGTAGGCTCT TCAATAGTTC CAAAAGGATT 18060 
CATATCAGAA TCCTCTTCTA TGTTAGGCAA CATAGTATTT TTAACCTGGA ATGACTGATT 18120 
CCACTTAAAT TGAGAAAACT GAATTGGAAT GTTATTTCCC ATACATTCAT TCCAAAATTT 18180 
ACGCACAAGA GTTAAACACT GTAACATATC TGGCAAGCTA ATTTTCATCT CACAAAATTT 18240 
TCCATTATTA CGTCTCAAGT TGTATTGATA GTTACAACAT TGAAACACAA AAACAGCAGG 183 00 
GAATGTAACT GCTGCGGCCT GAACTCTATT AACATCCTGA ACATCAATTC CTTCCACTCC 18360 
AGATATAGAA AATGGAGTTA TTTTAGGGAG TTGTTTTCCT ATTGTTTGTT TGCCACCATA 18 420 
ATTACATTCA CACTGACCGA ATATAAAAAG CATATTTCCG ACTTTAGCTT TCGGAAACAC 18 48 0 
AGCTTTTGTA GTTTCAATGG CATTTTGCAT AGCCAGCAAG GCCTTCTTTT CATCTGAAAA 18540 
GTTAAGACCA CAACTGCGAG GAGAACATTG CCCAAAACGC TGATGGGCAT CCTCAGCACA 18600 
TAACACGTAA TGTTCCTGAA CTATTTTTAC TACTTGTTTA TTCATACGCC CATTACTAAG 13660 
AACACCCCTC CCTTCCTTTA GGGCTTGCAC CCCTGCTTCC GATGTTGGAG GCATTTCAAT 18720 
TTCATTCACC CTTTTAAACA TGAAGTCACC ATGAAAACAT CTAGGACGGT CCTCCTCCCA 13 780 
ATCATGATAC CACAAATAAC AACCAGAAGC ATTAAAG7TT GG AATGAAGT CAATTTGCTT 13840 
ACAAATTGCA CTATATAGCA TTCTACCTCC TACAGTAGCC ATAGATTTAC TGCTACTATA 13900 
AGTCAAATTT ATAATTTTCA TCTTTTTCAT GTACTGAGCA AATAATTTTT CACAATCTC. 13960 
TTCTTCAGGA TGAAACTTCA TTTGACTGGT ATCAACTTTA ACACACTCTC CAAA uA,, 19020 
TAAAATTTCG AGCGCCGCTT GAACTTTATT CTGAAATTCT TCTGTAGTAG A.TTTCTCTT 19030 
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Fig 1 (cont) 

CTTGATAGAT TTAGTAACTT TTTTAGAAGA CATTATGTTA GTTTT7TTCT CGTTGTAGGA 19140 
TGGCTGAAAA AAATATGGGA GAGTCAGAGA AGGGTTTGAA CGAAGAAGAA TTTAACTCTA 19200 
TTCTATCAAA ACATCTGGAA AGACAAATTA AAATCTGTAA AGCGTTAACA TCAAAATTAT 19260 
CGAACTGGAA TATTGGAACA TTG77AGAAA AC7TG7TAT? TTGTCCTGAT GAAAGACAAT 19320 
CATCAGGTGA TCCCGACCCA AAACTAAACT TTTATCCGCC TTTTTTAATT CCGGAATGTC 19380 
TTGCATTGCA CTATCCA77T TTTCTAACAA CTCCTATTCC GCTATCATGC AAAGCGAACA 19440 
AAATAGGAAC TAACACTTAC CGAAAATGGA TGAACAATCA AGTCCTGGAT TTACAAATAC 19500 
CTTCCTTGGA AAATTGCAAA TGGGATGATA GCTTGGGAAA TGTAGATTTA ATTGAAGAGC 19560 
TTAAAGAGAA CCAAAAACTT GTTTTAGTAA AACAAGACCA TGAAAGAAAT ATATGGTTTA 19620 
AATCAAAATG CAAACAACTT CAAAGTTTCA GCTATCCCTC ACTCAGTCTG CCCCCAGTTT 19630 
TACAACAAGT TTTAATTGAA TCTCTTATCG GCATTAGTCA GGATCCTAAT AACTTTGACA 19740 
AAAATTACGA ACCTGCAATA AC TC TAG AAA AACTACAACA TGTAAACTGT GATCAAGATT 19800 
TAAAACAAGT TCAACAAAAA GTATCTTCAG CCGCTACATA CGGAATACTT TTGAAATGCA 19860 
TTCAGACTTT ATTCAGTGAC AAATTATTCA TTCAAAACTG CCAGGAATCA TTACATTACA 19920 
CCTTTAACCA TGGTTATGTA AAATTACTTC AATTTTTGAC AAATGTCAGT TTAAGCGAAT 19980 
TTGTAACTTT CCATGGTTTA ACACACAGGA AC AG AC TC AA TAATCCGCAG CAACATACAC 20040 
AATTGGCAAC CGAAGACAAA ATAGACTATA TCATAGATAC AGTGTATTTA TTTTTGGTAT 20100 
TTACGTGGCA GACAGCAATG GATATTTGGA ATCAAACATT AGATGATAAA ACAATAAATA 20160 
TAATTAAAGA GGAATTAAAC CAAAATTTTG AGAAAATTGT CAAAGCTGAA TCAGTTGATG 20220 
AAGTTTCTGA AATTTTAAAG TCTATTATTT TCCCTGAACT CATGCTGCGA GCTTTTTGTT 20280 
CTAATTTACC TGATTTTATA AATCAGAGTC AGATATCAAA TTTTAGAAAC TTTATCTGCA 20340 
TTAAATCCGG CATACCGCAG TCAATTTGCC CCCTATTACC TTCAGATCTA ATTCCTTTAA 20400 
CTTTCCTAGA AAGTCATCCA ATACTCTGGA GTCATGTAAT GTTACTAAAT CTTGCTTCAT 20460 
TTCTAGTAAA CCAAGGCAAT TATTTGCATG AACCCGAAAA ACCTTTAAAT ATTTCATCAG 20520 
TTTACTGTAA TTGTAATTTA TGCTCTCCGC AAAGAATGCC ATGTTACAAT AGCAGTTTGA 20580 
TGCAAGAAAT ACTAACCATT GATAAATTCG AGTTCACAAA CTCTGATAAA ACAAAACAGC 20640 
TAAAACTGAC CCTCCAAACT TTTGCTAATG CCTATCTTAA CAAATTTAAC TCAGCAGAAT ' 20 700 
TCTACCATGA CCAAGTTTTA TTCTACAAAA ACTGTAAAAG TAAATTTTCT AACCAATTAA 20 760 
CAGCTTGTGT AATAAAAGAC GAAAAATTAT TGGCTAAAAT AGCAGAAATT CAAATAACGC 20820 
GGGAAAAAGA ACTCTTAAAA AGAGGAAAAG GAATTTATTT GGATCCAGAA AC AG GAG AAA 20880 
TCTTAAACAA TGGAGAAGCC ATATCATCCT CTGAAAACTT CCAAAGGCAA AGAACTAGCT 20940 
ATGCTCTACC ATCAAATGAA GGAGAGCGAG CTGGATGGGA AGCCGATGAG CGAAGAAGAC 21000 
GAAGGAGAAG TGAGTGAGGA TGAAACAGAG ACAACAATTC CAAAGAAAAT GAAGTTTACA 21060 
AGTAAGTAAG CTCTAAATTT TTTATATTAA AAACTGAATT TTTTTAGACA AAATTATT7T 21120 
AAATTAAATC TTTATAGCTA GCAGTTGATC TTTGTTCGTT TTTCAGAAAA CTCAAGTGTT 2118 0 
CAGTCATATC AAGTTCACTT GCCTCTGAAA CACGAAATTG CGGAAATTCT AGAAAAAATT 21240 
AGACTAGAAT CTAAAAAATA TCCAGGAAAA GTTTATCAAA TAAGAAATAG AACTCCAGCA 213 00 
AGTATTACAA AACGATACCT GTATGAAAGA GATCTGAAGA AACTGTTCCA GTATCTAGAA 213 60 
GACGCAAAGA AGCTTTACGC TAAGTACCAA AGCTGAGGCT TTATAGTTTT AAATTTTCCC 21420 
GCCATGGCTC AACCAGTGAC GCCTTACGTC TGGAAATACC AACCAGAAAC AGGA7A7AC7 21480 
GCTGGAGCCC ATCAAAATTA TAACACTGTT A7CAACTGGT TGCATGCCAA TCCACAAATG 21540 
TTTGCCAGAA TTCAACATAT AAACACCGCA CGCAATGTTA TGGACAAATT CCGCTCTGAT 21600 
TTGACCCGAG ATGACATCGC GGTTAACATC AACAACTGGC CTGCAGAGGA TTTAATGCAA 21o60 
T^r-T-ACAT TCCTCCGACC TCTAAATCCG C7TCAACCAT AAATGACTG^ 21720 
TTclc TA^cI CTcXaGGAaI KAAC7CAG7 .GGAACTAG7G AAC7AAACGG C7GGGGATCT 21780 
AACCGCCTGA CT7CC7A7CC GGATATTCCA CCCAT777AA AGTA7GAAAG GCCTGGTCAA 21840 
CAAC77CAAG GCCAAGGAC7 TTT7AAGCAA GAAAATAT7C A.77AT7T7A C-AA.uTCwv, 21900 
CGCC7CCC7C GC7C7GGAGG A7TAAC7CCG CAACAA777G TAAAA-AA71 ^GC-TG.. 21960 

™_, „.,.,., , rr , t- tc aGAVCTATG AG7G7A777C CGAAAGAA77 iAGWu- 22020 

S2&Lu» TC^TCA** CtrWJjm I^TJAJAA «... 
„ AT . T T- iT i- T~ACACTAAA GCATwwww-., TAT . . .Co 1 ~ G^uAiAM^ 

TATATCAAAG ACCCG7AATT CTC7AACT7T AAA7CA7777 TTGAACTAAi ^AA^CA, 22200 
77AAA7GTAG GAAT7AATAT ATCAGAAACC AG7AACAAGC CAGAATTrtAA AiAiACTT^ 22260 
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Fig 1 (cont) 
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AG G T AAA 7 A T 

TTTGCTTATC 

ATAATAAGGA 

CGCTATCAAA 

AAAGCCAGTT 

A G G AAAC AAA 



CTGAAGAGAG 

AAGGGTTGAT 

TAACTGTAAA 

CTCCAAAAGT 

ATTCTCTAAC 

TAAATGATGA 

GATTGAGTTT 

TAAAGGAACC 

ATATAACTCC 

GTTCTTTAGG 

ACATTGCGAA 

CCTTACGAAT 

CTGACGGCTC 

CCATAAACTA 

AACCATTAAC 

ATAGTGATGG 

TTACCCTCAG 

TTACAGCTCC 

TGGAAGTGGC 

ATGCTTTAAC 

AAAC TAA TAG 

GAACCATAAC 

CTACTACTGT 

TAACTTCTTT 

TTGTTACATC 

CTAAAAACCA 

TTAGTATGTT 

AGTAATCTTT 

AAGCAAGGTT 

GGCGGTAAAT 

AGGTTTTTAC 

CAATTGTAAC 

TTCCAAAGTC 

CAACAATTGG 

ATCAGTGTCT 

AATTGATTTT 

ACATTAATGT 

TGATGAATCC 

AACCTCTGTA 

AACGAGATAT 

AAGTTTCAGA 

AGAAATAGTT 

ATTACAACAA 

ATTAAAGCAA 

ACAATGACAA 

TTTTTGGCTT 

CTTCAAATTA 

TTCAAGTCCA 

CATCACTGCC 

AAGAAATTCC 

GATTCCCTAT 

CGCATATTTT 

TCTTTCTGAG 
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Fig 1 (cont) 



CTAAAACATA 

CCCAGTAGTT 

GTGGTGATAA 

TCTACAAAAA 

AAAAXGCATT 

TAACATTTAT 

TACT TAG AAA 

GAAGAACATT 

TAGCTGCAAC 

ACCGGAGTTT 

CCAACCCAGT 

CCAATTACCA 

AACTTGATGA 

TAAAGATTTT 

ACAAAAGAAC 

ACCTCTCCCA 

TGTTTTTCCT 

AACCCATTGC 

GCCAAAACAG 

AAAAATTAAT 

ACCATCCATG 

TACAGGCCAA 

ATATTTCTGC 

CAGCAGCAAA 

TACATCACAC 

AATAAATTTC 

AAACTTGGAA 

ATCTTTATTT 

CAAATCCATG 

TAAAAAATAC 

CAAAACAAAC 

ACTGAATGCG 

AAACACAGAA 

TCATGCATTG 

TACTGTATGA 

CTGCTTCACA 

AACGGCATTT 

CAATAAAAAA 

TTTTTTCAGC 

GAGCTTGTTT 

GAACCGCATT 

AAAACCACAT 

TGATACGGAG 

GGTAATAAAG 

CGACGAGAAA 

AGGTGCAATT 

ACAAAACTAG 

GGTAATTCAA 

T G T AAAC AG C 

C CATC GAG AT 

AAGCTCAGAC 

AAAGCCTTAA 

ACCGCCACCC 



TACAGTTTTA 

TAAGTACCGG 

AAAAGAAAGT 

AC AAAAAAA T 

AAAAAGTTAC 

GTTAGTAAAT 

CTCCAAATAA 

ATAAGATTAA 

TTACTAGTCT 

ATAAAGTAGT 

CCAATAAAAT 

CTTGACTTTT 

CCAAGAACTG 

TAACTACCCA 

TGGCAAAGAT 

TGTTTTTTCC 

TTTACTCAAC 

GTTCCTCTAA 

CTCAACTTAA 

AACCGTGAAG 

TAATGTTACA 

GCATCAAACT 

CACAAATCTC 

TAAGGATGAG 

CAAAATGAAG 

TGCAGCAGCT 

CTTTTCACCA 

GTCAAATAAC 

ACCTGTAAAA 

AGTCAAATCA 

TTCAGAAAAT 

GCATTTTGCA 

ATGCTTCTGC 

CAAAGAAGGA 

CTCATTTTCA 

AATTAATAAT 

AAGAAAGCAA 

TTTCAACGTT 

TGAAGCTAAA 

TAATTGCTTC 

CAGGCCAATT 

CCTCTAAATC 

AATACAACAA 

TTTTACAGCA 

GCCGAATGTC 

GTATAAAATA 

CAATGTCTTT 

CAAAAAATTC 

GAAAAGAAAA 

ACACAGAAAA 

ATTCTAAGCC 

TGTTTTCTTA 

AGGGGACAAA 



TCGCCATTAA 

GAAGAGACTA 

AATTCAGAAT 

TATAAGGTTC 

CTTGAATATA 

GATAGTCTTT 

ATTCTGTAAC 

AACCACTCTA 

TCTCCAAATG 

AGTCTGATTA 

TCCAACCTTA 

ATTTAACCTC 

ACCTTTCCCA 

AAAAACCCGC 

ATTCCATCAT 

CTTTTGCACA 

TGCCAGAATA 

GAAAAAAATT 

AATCTCTCAT 

TACTAGATCT 

AACACTTTTT 

TTCTTCTGTA 

TATGATCTTT 

TTAAACATGC 

GTACAGACAA 

TACAATCTAT 

ACTCAATCAT 

AAAATATCTC 

GATACAACAG 

CATACTTTTT 

GGATGCATAC 

CCTCCAGAAA 

AAAATCTGTA 

CATTCAACAG 

TGAATTACAG 

GCTAATTTCT 

CAAAATGATT 

AAACAATGCA 

TCACAGCCTA 

AAATCCATCT 

CCAGTTATTG 

AACCCACAAA 

TAAATGGGGA 

CCAATTACAA 

ATTCAGAACA 

AGAATCCTTA 

GCTTGTAAGT 

AG G G G C C T A — 

AA T AAC AC A C 

ATTCAGAACA 

AAAAA T T T T T 

ACTGATTTAA 

TCTTGCCAAG 



ATCTAAAAGC 

TACAATATAC 

TAGGATAAAG 

ATAGAAAACC 

AATTCAGATC 

AAAAA T TAG A 

CAAGAGAAAA 

AAATCTGAAA 

TTGCAGGCAT 

TATCTGAAAA 

ACTTCTTTCC 

AATTATGTTA 

TGTTTTTCTG 

TCTCATGATT 

ACTTCTCCAA 

AACAGGATAT 

AAAATGAACA 

ATCCCGCCCA 

GAATAAAACC 

AATAATGATA 

TATTGAGTTT 

TTTCTTCCTA 

ACAGTAACAC 

GATACAATGA 

CATCGCATGA 

ATTAGCAAAC 

GTTATTATAA 

AATCCACAGC 

AAAACAGAAA 

CTCACTTACA 

AAGAACATTC 

AATGCAGTCC 

AAGAAACTAA 

CAAAAGGATC 

TCTGTAACTT 

TTAAGCAGCT 

TCTTAAAATA 

AAAATGCATT 

TTTTATTACA 

TCTTACAATT 

TTTAAATTTG 

GATCTATGAT 

TTAACAAGGG 

G C AAC AG G 7 A 

GCATTGCATT 

ATGACAGTGA 

TTCAAAAATA 

AAAATTAATC 

C C AAAAA T AA 

CTCAGTGTCA 

TGAGAA\- . vjv- 

CTGCCCTAAC 

AAC TAC AAGT 



CATCCTAAAT 

TTGATATTGA 

CATTCTCCCA 

TACTATTTAA 

AC C T AAAAAA 

AAAGAATCAA 

ATTGTAACCT 

AGCATTATGA 

TTCAAAAAAT 

AGTTTAACTT 

TGCTAAAACT 

CATGTTATTC 

AAATAACAAA 

TTTTCTTATA 

CTGTCAAAAC 

AAAAAATATT 

GCTTAACCTT 

ATATGCTAAA 

CAGAGAAAAT 

TTTTGAACTC 

TTTCTTACAA 

GACCACAAAA 

TTACATTTAA 

CCAGAAGGAA 

AATCTTAAAT' 

CGTTTTATAT 

CACATTACAA 

TCATCTGGCA 

ATTAATGCCA 

AAACTTTGTG 

TCCTCTCAAA 

ATTGAGAGGC 

CAACTTCCAA 

GTGATGAGCC 

ACTATAATGC 

CAAAGAAAAC 

CATTTTTCCA 

TTTATGCACA 

TGATTTTGTA 

TTTTCTTTTT 

C TAC AG AAAC 

CCACACAAAA 

ACGCAACACA 

ATGGAGTATA 

TTATCTTCTC 

TGAATTGAGG 

TCTTCATCCA 

AGACTAATTT 

AAAACTCTTA 

TGTTTCTTAA 

AAAAAC C C AG 

AGG AAC T CC A 

CCATAAAACA 



GGACCTCTAG 

TGTCTGTTAA 

TGTTGATTCA 

CAAATCTATA 

CGAAAAAAAA 

GTCGCTTTTA 

AAAAGGTAAA 

AAAATTCTGA 

CAAGAGGAAA 

CCTTTTCAAC 

CCATAAAAGT 

TAC C CAT AAA 

AATGTTGATT 

T AAAC AG GAT 

ATACCACTTA 

TTTGCCACAA 

TTTCCCTCTT 

GGCTTCTCCC 

TTCCAGTAAT 

ATAAAAATCC 

CTGCATTACA 

TTACAGACTT 

ATGGGGAATA 

GATAATACAA 

GTGATTTTAC 

ACAAACATAA 

ATTTTGCTAT 

GCAAACTTCG 

TTCAATAACA 

AGCAGGCCTC 

AATTGCTTTA 

TCTTCTCTTA 

ATTCCAATCA 

AATAAAGCTT 

ATTTTAAGCT 

TCATCAGGAC 

GCATGATGAA 

GTGAAAGTAA 

TGCTCCAAAA 

ATAAACACCA 

TGCAGACCAC 

ACACAAAGAA 

ATGACCCGAA 

TTTCCCAATG 

AAACCTCTTA 

AAAAGCAAAA 

AATCTCAGTC 

AATATCATCT 

CCCCTGTTAT 

ATTGTTCCCA 

TTTTTATAAC 

CATTCCGGCC 

ACATCCTGCA 
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Fig 1 (cont) 

AATTATACCA AAGGTTTCTA TGGTCGACAC AATTACAACC TGACCTAAAA GGTGAATAAA 28680 

GCAGTAAATA AGGATGAGTT AAACAGGCCA CACAATGTCC AGAATGTAAA AAATGCTTTG 28740 

TTTGGCACCA ACCAGACCAC AGCTGAAGCA AAGGAAAATT GTAGCGAACA CATTCTTCTC 28800 

GTAATCTGTT TAACACAGAA CAACATTCAA TTCTGGCAAA CCTCTTTAAA AAATGTTTTC 28860 

TGAAATATTT CTfTAAAATG ACAGTTTGCA ACTCTGGAAA ACACAAAATA AAAGCCGCAA 28 920 

TATCTCTACT GCTTAAATAT AAAAATATCA TTGTCCAAAT TTCTACTGGT AAAACTGAAA 28980 

GCATCTTCTT CCTATTAAAA AAAGAAAAGT GTTTTCAAAT TATATTAGAC TCTAACCAAA 29040 

AAAATTCAAA TACTTTTCCT TTATAATGTA CATTAAGAAT AAAAATATAC TCACCGTTTA 29100 

AAAGTAGAAC TTAACAGTAT AATATAAATA CAAGTGAGCT GAACAACGAC AGCCGATTTC 29160 

AGCCGGAGCA AAATTAAAAA GAATAAAAGG ATCAAACCAA CACGTAGGAC AGTCTACTCC 29220 

AAAACAGTAA CGGCAGTATG ACACAGAAGG AGAGGAACTA AGTCCAGGAA ACTTCGCCCG 29280 

GTGCGATAAA AAGTAACGCC GCCGGAAAGC AG7TGAATAC AAAAGAGGTA AAAATTCACG 293 40 

AAAAACAGAA GCAAAAACTA CTAAATCTGC TATTGGCAAA TAAAGAAAAA TTTCAAACCA 29400 

TATTTCCAAA GGAAGAAAAG CAATCATACC GTAGAAGAAC CTGAAGGCGA CCGCAAACGT 29460 

GCTCCCGTAC CACAACGTCA CACGCCACAC CCACTGGGAA AACCCACACG CCCCGCCTCT 29520 

GTGCAACGTT ATATATATGA ATAG 295 44 
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Fig. 9 
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Figure 1 J NUCLEOTIDE SEQUENCE OF PLASMID pQAVlOO 

Kpnl site (with 3* terminal sequence) 



CTATTCATATATATAACGTTGCACAGAGGCGGGGCGTGTGGGTTTTTTATTGTTTATTCT — w ? 
CATGGAATTTACAAAGAAGTAAGTTGTTGGATCTTTATTCAGAATTCTTTTAACAATGAC - 
TTTTTTACTTATTACATTTTTCATCTTTTTTACTTCACATGATATTTTACTTAAATTTTG'- 
TACATACAAGCGAAAATTCGCATAAAATGTGTTACTTTAAAAAGTTAAATTTTTTTTTTA- 

acgcataaatggacgtacagcagcaattggaatagcaggaagggccattgtaaagtgtgt- 
tcctgctgatgccgctgcagaaaggatagatgctatcgtacgcataaacccccc'xcctat- 
ttgttcatctgctgctrttaxtatatcttctgccaatctaggtgatatttgct'tttgaat^ 
gctgtttccaaaagcttgcatcatcggattttcaattaaatggattggatttgcagaatt-- 

tcctAaaaaatagcccaacccatctaaagcagttaaaagtattctccctccaggaaccac 
agatataat taag c g gag caaccgagag gt taaattc cag ggt cct c c gaagagagtatc 
taggatcaggccaagaagtgaaccaaaaagacttgtaagtagaagttgtctgatatgctt 

tg<3agaggactgtaaaaattgcaaaacggtatctaatgaccatttcttctttacttttac 
atctgtatcatgttctccatcagaaggtcttattgggaagtaccattggtcacgagcatc 
tttgaagacttctgtttcttgaaattctgttttcggtaagcgactagcagttatggtatt 
aggaatattgacggtaatgttatto^catctacaatttctggaggaatccatcttgcata 
<3<^tgaaatgggttttgtgggttctttcaatatataattgcgaggagggtttttccaaaa 
tctctgaacataagtattttctgattttggcggttttttgctttttcgcgctctttttct 

TGGCTTTGGTCTTTGAAATTTTTTCTTCCTTTTTCTGTAGGCTCCTCCTGCTAAAGCTGT 

gttatttgtgacgtacatcctgttagctacacgattttcccggactgcaaatttttttgX 

CAAAT G GAAAAGAAATT GCT GAAAC CT T CT ATTAAT CATAT AAATTGTCAGT GGAATCAT 
GAATCAGATAGTGCAGGATTTTTTCTTTTTGATACTGATAATTTATACTATTATGTATTG 
GATCAAGTGTCTTGGATATGTTTAAGAGATATAACTCTTGATTGTGATCGCATGTGGTTA 
GCGGTTTGTTTTTGTTTGTGCAAATCTAAATTT GATGTACACAATATT CTAGCGGGAGTA 

CATGTTAT g taat gaaaat gac gt c gg ggatt gaat ggat t gagc cttat TTGACATTTT 

TCTGTGATTTTTTTGCCTTATTAGGAAAXAAATTTGTGGCGCCAGTACGATGGAGATTGG 
AATGACTCCTGCATTTACAGAAAGGAATTTGTACTGTGTTTTGCTTGACTTTAATTTAAG 
AT GGT AT CAGCAGAT ATTTAACCCAATAT GGAT TAAGCCAAATTTATGGGCTT T CT CT GA 
TTTTTTAAAAAAAATGGCCTTTATTTATGCTAGCGACTTGGCGTTGTTAAATTCTTACAT 
CCCTGGTAATGTTTGTAACAAACTTGATATCATCAAGAAAGATCTTCCTGAAGATTTTAC 
CGTGTCTATGTTTTGTGTCTTAGTGTGTTGGCTTGCTTCTTTCTGTAAAGGTTCTAATTT 
AGCTGAAACTCGCCAGAATTGTCACGCGGTAAGCAAATTTCTGGCACAACTATCAAAATT 
AATAAAACCCTAATTTTTAGTTTGT AAAAATAGAATT CAAATTTTTAACGCCACAATGAC 
TTCGGCGGAGTTTTCTGTTGAATTTCCTTATGTTTCTAAGCCAATTGTTCCATGGCCTGC 
T T C G SCAT CTT CTAATAATT CATC GAGT CAGAATAT T GACTTTCCTGT T CTTAAACCAGA 
TCAAGATCCAATAGCCTTCTTTCAAACTAAGAATACGGCTTACTTACAACCTGGAGCTAC 
TTATTACTGGAAGTGTATCGAACTGTCAAAGCCTATTCACATTTACGGTCAAGGAGCTAC 
AGTACAACTTGTCGGACCTGGACCTGTGTTTGTTTTCAACAGTGAAASTGTTATTCCTGA 
AGATTTTTACGTCGTGTTTGAAAATATCAACTTTATTGAAGATGAATTTCCTATTAGAAG 
T GG CCAGTTAAGTTTAGGACT T ACAACT CACAGTGCT GTAT G GT TT AT CAAT GT AT GGAA 
AACTTCAATAGTCAATTGTAACTTTAAAAATTTTAGGGGAGCGGCTCTTTGGTATTCAGA 
T AATAGAAATTTTTGGAATGC GAGAAAAT GG AATCAGCAG CAT T TAGTTT CAAAT TGTCG 
TTTTAATGGTTGTAGAATTGGAATTTCTAATACTGGTTCATCTGAATATTCCATAGCCAG 
TCAAAATCAATTTTATGATTGTCAAATCTGTTTTAATGTAACCGGGGGTAATTGGTCTAG 
AAATAATAATGTTATTGTTAACTGTAGATGTGCTTATCTGCATGTTGGAGATAACATGTG 
GTATGAAGGCCATTCCGAAAATAATAATCCCGCTAAGGGTACTTTCTGCAATAACATAAT 
TAACCAT GCT GAT AACGGAGG CAAT GT CTGGC CT ACT CAGTTTAAACT TACAGAT GGATC 
AAC GATACAGTTAGCAT CAT TTTATTT T GAT GATAATCAAGAAATTC CAC CTT GTTATAG 
CGGTAATTTTCATT GGTTTGGAGAT GTAAACATT GTAAATTTTTCTACCACAAAAATT GA 
TAAAT GGT GCATTACT G GAT GTAATTTCTAT GGTAATAC ACAT GCAGCTAAC GAT GCT GG 
TCAAGTTCAGGTTGCTGAAGCTGTAAAAGACAAAGTGTTTATTATTGGGTGTTCTGGTAA 
TAATGTAACCATGAAAAATATTGTAGAAGGTAACATGACTCCAAAAATTGGTACAATAAA 
GTAAAAAACTTTTTATTCAAAACAAAATGGATTTACATTTAAACGTTTTACATATTGATT 
CT GC GTAT AAG TT CTTTTTCTAAACACT CTT CTAAT TT CC ATACAT GCTTGAT AAAACAA 
ACTTTGTAAATTCATAAATATAGGTTTGACTTGATCAGAAGGTGAATAATAGCTCCATCT 
AAAT GATT CGGTAATAGGAACATTATTATATATTAACCAGCTATATTTTGAGT TAACTCT 
T G CAT GAT C CACT ATAT CT T T AAGT ACAG GGAT AAGT GCACT C GGAAAT C CAAAAGAATA 
GTTTTTAATAAAT CTATTTATCTGT GAAGAATCAAGCT GCGGACTAATAACATGACATTT 
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T GAT TGAATTTTTAAATCCTTAATAT TTCCTCTATCAT GAC GCGGGTTCATATTATGTAA 

AACTACTACAACAGTGTAACCATTACATTT GG CAAATCTATTAAAAATTTTTGACGGTAA 

AGCAT GAAAGAAAGAACTTATAGAATGACAT GATCCCAATTGATT CAT ACATT CATC TAT 

T ATAATACAGATAGATCCTTCACTTGCAGCTC TGCAGAAT ATATTAT CT GGATTATCAAT 

ATTTAGATT AGTAT CGGAAATAGCAT CTTTGAAAGCTAATTGTATAAATTTTGGATTTAA 

TGTTTTTGTTAGTC5GATTAGAGAATGCATCGTAGTTTCCTTCAACACACTGTGCTTTCCA 

CGCAATTTTTTCTTCTAATGGAACAGTACCTTTTTCTGGAGTTATGAAAAAAATTGTTTC 

TGGTATTGGATCAATTAGTTTTCCAGATA7AATATTTCTTATAAATTGAGATTTTCCGCT 

ACCTGTGGGTCCATATACAGTAACAATGAATGGTTGTAATCCGCAGTTTAAACTGGGTAT 

ACAGCCATCTTTTAAC AGATTGTGAGCCT CAT TTACAGTTTTT T GATAATTTACAGCAAT 

ATTGTGTAAATCAGXCATAAGTTGACCATGATACATACATTTATOVAAAACTTCTTGACT 

TTCTGGAAATGGArTTCTGCAAATAGAAGGATCTATCTTTACAACATCATTTTTCCAATT 

TAATGTGTCACXTAAAAATTTTCCCAAAAAGCATTTTCTGTCAATGGTTCTTGCGGTCTT 

GGATTTGGGTGTCTCTTGTCGTACGGGTAAAGTAAGTATCCTTTCTTCCACTGGATCCCT 

TTCCTCATCGTTTGATCCTTCCAAGGTCTCAGAATTCTGGTTAJGTTGCtTCTCTACCACC 

GTGAATGGTACATCGGTTCCACTTGCGGTTTQCACSTGXCTTTTTTAAACTTTTCCrCGAT 

GTCTGAAACTCTTTCTGTGGTTGTTCTAATAAATTATAGTCAGTAAAACAATGTTTTAGA 

ATTTCATAGTTTAAACAATTTTTAGCATGACCTTTGGCTCTTAATTTTCCTTCTCCAATA 

AATrrACAGTTTTTACAAGTTATGTCTTTTAAAGCAXATAATTTAGGAGCTAAAATACAT 

GTTTCTGAACTGAATGCTTCAGCTCCGCAACGGTTACAAACAGTTTCGCATTCAACCAAC 

CAAGTTAGACATGGATGTTTTTCATCAAAGATTAAATTTGAGTTATATTTTTTAAGTCTA 

TGTAATCCTTTTGAT7VACATGAGTTGGTGGCCCTTTTCTGTTAAGAATAACGAGTCTGTA 

TCACCATA^TACTTTTTATCTCCCTTTCTATGTAAGGTTTACCCATATCTTCCCCATAT 

AAAATTTCTGC CCACTCACT CAT GAAAGCT CT GGT CCAAGCCAG CACAAAGGAT GCTATC 

T GAGTT GGATATCGGTTGTTCTTGAT CCATTCTTCCTTAT CCTCAATAGTTGTTAAAATT 

AAATCATTAC AAT CAGCAGATAAAAAAGT TATAGGCTTAAAAGT C AC GT GAT CT T GATTT 

CCTATAAAAAGTGGAAAATTAAAATTTTCATTTGTGTCTTTGGAATCTTTGGtXGGCATT 

TCAGGTAGGTTT GAAAAATACTGATT CCACTCAAATGAACGTTTTGGTAATGATTTACTA 

ATCACAGTTGTGTATGATGTAATTTCAGCTGATCCATTTTCTAATCTTTTTTTATCTTTC 

TCTTCAATATTTTCAGCAAACACTACTTTCTTTTTATCTATACGGGTAGCAAACGAACCA 

TATAAAGCATTT GATAACAATTTACTTATACTTCGCTGAATCTTGTT GTTACTTTTACTT 

GCTTTTTCTTTAGCCATAAXATTTACTTTCACATATTTTTGACATAACGGTTtCCAGTCA 

CT CCATACAGCATACATTTCAGAGCTTTXGATTAXTTTGCATTT CCATCCTCTATTGTGT 

AAGGTGATTAAATCGATAGAGGTCAGTACTTCATTTATCAATGTTTCATTTGACCAGCAT 

AACTTTCCACTTTTTTTAGAACATAATGGAGGTAACACATCAAGATAATCTAATGATGGG 

GGTTCACAATCGGCTACCACAATCATAGGTTTGATTGAATTGTCAAAATAATCTATTTTT 

TCTTTTCTTTGTAGrAGTTCTTGAAAGTAATCTATTTGTGCATTGGCTTCAAAAGCATTT 

AAAGTTTTT C CATAT GGAAGTGGAT GC GTTAAGGCAC TAG CAT ACATT CCGCAGATAI CA 

TACAC^TATATTGCTTCTTCAAATATTCCTAAAAATGAAiGGAXAACATCTTCCT 

AAACTCATTCTAACAAAATCATACATTTTTTCTGATGGAGCTTCCAAATTTCTTAGGAAT 

T CAGAGGGATGATCTT CT T CATTAT AAAAGATTTGTTTAAAGAATGCT T GAGTATTACTA 

CTAATTGTAGGACGTTGGAATATATTAAAA<^CACTCAAGCTTTAAAGATGTTGTACAG 

AACT CTT GAT AACCTTCT ATAAGTTTTT CAACTAATTGAGCC GTAACTATAACAT C ATCA 

ATACAATACTCCTTAGCTTCCTCTAATAAGTTGTATTTTTGGTTGTGTTTTGGTTTGTTT 

TGTAAATATTCTTCAAATGAATTCCAATATTTTTGAACTGGATAACCATTGTTTTCTTTT 

TCATATTCTCCCAACATAAAAAAATCATTGATTGCCCTGTAAGGACAATAACCTTTGCTA 

ACACTCAACTGATATGCAGTAGCAGCGTCT CTTAAAGAAGAGT GGGTTAACAAAAATGTA 

TCCCTAACCATAAATTTTATACCTTGCCATTTCATATCTTCAAAATTAATAATTCCATTT 

TTCCATCTTTCATAAGTTGTATGTGAAGGTTTCTTAAAGCAAGGATTTGGAAGAGATAAT 

GTAATATCATTAAATAACAGTTT T CCAGCAC GAGGCATAAAGCT T CTTGTCAGCT TAAAC 

ATTGAAAGTTCTTCACTGTCTATTCCTTCTAATACATGACTTGCAAGTATGATTTCATCA 

AAACCACAGATATTATGACCTACTACATATAATTCAATATATCTTGGTTCGCACTGTTTT 

^ T ^ t TTTTTCTT T ATTTAAGACCAT GATGT CTT CATAT GATAAATT TGATT C AAGAC CA 

TGATTTTCACAAAACGTTGACCAGTATTTTTTAGCTACTGW^TTTGTAGCTCTGTTCTG 

AATTTTTTAAAAGCTAT GC CAATTT CATCT T CTTTTTTATTTAACATTACAAAACATT CT 

CTGTTTACCTCATAACCTATATCGGTAGCTATTTTAGAAGCAATTTTTATGAGTGATTTA 

CAT CCAATT AACTTAAAAACCAACAAGTAAG GAGTTAACT GTTTTCCATACAAAGAATGG 

TAAGTATATGTTTCAATATCATAAACAATAAAAAGACGTTTTGCTTTTATGGCTCCAACT 

GGATTAAATTTGATTTTTTCCCAC«GAGTTTTGTTTCATGGTGAATATTGTGATAATAG 

AAGTCCCGTCTTCTGGATGAGCAGTTGTGTATATTACTATAAATTGTTCCGCAGAATTCA 

CATTTATTCTGTTGTTTAACACTTTTTATTAAATATATTTCT CCTTTTAAAAT CAATAAT 

TCTATTGGTAACAAATTTCCATTAAGAATTTCTTCAGTCATCTTAAAAAATCTTTTGTTG 

AACTTCCATATTTTTAAAGATACGGGGGTGTTAGAATCACAAAGTTTTAAAACATCTAAA 
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ACATTTTCTACTTTCTTGAAAGAATTTAATTT TAAACCCTGAATT GCAAAGTAATTATAA 
AAAC TTTTTTCAAAAT TCTT GTAGTAT ATAATTTTTATAT ATGTATCCTCATATATT CCA 
GTAATATAAGTAGTAGTT CT TTGCTTTATTAT TGTCTTTGAAG C CAT CTGTTTAAAGCCG 
CTTC CCGTACT C GCT C AAAGCTT CT TAAAACAACTT CAT TT GT ACTATAGCCAACAATTC 
CAGACAATTTTATTCTAAATGCTATTTCAACTGAATCTAAATCTGAAAAATCCGTGTTTA 
CTTGGTTGATTACTTCTTCTATGCTCCCACTGTCTTCTACGAAGTCTATATCTTGAAGTA 
ATTGGTCTCTTTCTTCTGGAGTTGAAAAAC3AGTAAGATCTTTCATTAGCTTCTATAATTC 
CTAAAAAAT CACGAGTTATT CT GCTATATAGTT GTCT GAAT GCTT GTGTTTCTCTATTAA 
ACCAAACTCTAGTAAATATATCTTCTCCATTTTCATTTCTACCTCTTAATATAATTTGAA 
CAAATTGGATTCCAATArTTCTGGCAGCTAACCTATTTTGCACTAAATTTAAGTATAACT 
AATAT AGCGT GC TT G CCACAT GCT CT AATAT AAAGAAATACACTAAC CATTTTT G AAT AA 
AAT CAT CAGT CAAT C TAT T T T CATTATAAAAT CTAATAAGTAAT T GAAAAAAT T CACTT C 
CGTAATTAAAAAAATTACTCCTTCTTGCTTCAGGAGTTAATTCTTCTTCTAAATrTTGAA 
TTAAATCTACTATTGAAGCTATCACTTCATCAXTAAATTCTTCCCTACTCAGATCGCTTG 
AGCTCGGCTCGCGATCTGAAAPlTCCTTCATCTTCTATTTCAGGAACAGTAAGAGGAGAAC 
TAGAAGTTTCTTCAACATTCCTTACCCTTTGGCGTCTATTAACAGGTAATCTATCAATAA 
ATCT^CTGATTACATCACCCCTTGAACGTCTCATTATTTCAGTAATAGCTCTATAACTTT 
CCCTAGGTCTTAATCTGAArGGTAATCCTACTCTTGTCCCTOACCTTAAAGTTAATGCTC 
CACCATGCATCCCACCTTTTCCTAAAGTTAAXACAGTTGCTAAATCTTTTAAATTAATTC 
GATTTTCAGCTT CTGGAATTT C CAGCT GT GAAAAT T CAT CTATAAAAAGCT CAAT C CAGA 
ATTCAGAAAAAGGTAAGTCTAATATACATTCACTAITATGCAT 

ATTTACATAAAGCTTTTTTAATTTTACAAATTAACTTTATAAGGTAAGTATCCCTTTCTT 
GCAAATTTAAAACCATAAAAGCTTGAGAAAAAGGTTGATAATGCTGCTGAAAAGATCTAT 
TCTGATTTTGAGCTGAAATAGCGGAGCCAAAACCTTGCATGTCTGCAAGTrGCAGACTCC 
CTAATATTCTATCCATTAAAACCGCGTTTTGAATTTGACTAATTGTTTGTGAMAATTTT 
CTAGATTTT GAATTGCTCT CATATATGACCCAGTAXTTAT GGAGTATGAACAAT CAGTTA 
AAATTTGCC AGGT CATGCGTCT CTCAAAACTTATAGGTGAAAGATACAACTTATATGAAA 
TGTTGCTGTAAGTCCGCTGATCAAACAGATACTGGTTTAAAACTCGCGCCACATAAAAAT 
ACCCAATTAATAAATTTGGT GGAGGTTCTCCTT CAAAXGGTGGTT GTGAAGTAACAGGTC 
CTCTTGG GC GT AAAT C GAGTAATT GAGTCACTGGATAAT TAAAAAATCGATTAGCCCATT 
TTATTCCCCTT TCATGTATAGTCCTTGACCT GGCAATACTTCGATTATTAAGGU CAAGT G 
TTAAACGT AAATAT C GTAAG GTAT GTT GACTTTGC CCAGT GAGTT GTT GCCATTGGT GAA 
TCTGCAAGGCAAAC^AAAAATTTATCTTATTACTGCAGATGCATCCTAITTTACAAAATT 
TACGTTCATCATTGGAAACTCCAGACTTATCAAGCAACTCCCCGGGCACGTCAAATAAAA 
ATGAAAAAGATGAATTTGAACCAGGAGTT GGCATTTCTAGGAAAC CATCTGATGAATTTA 
ATATGAGACGATCTCAAAGAGATGATAATTTACCTAAAAGTCAGATACCAGTAGTAGATA 
TACTAGATGAT AAAAATCCT AAAAT GG CAGAAGAAC GAGACTTAATGT ATAAATCTTCTG 
CTTGCATAAAACTTCATGATTCTAAACAATTAAAAACTGATATGTTCAGGCCGGATTTTG 
CT G GAACT AGT CCAGCT C^AAGACACAT AGAAGC CGCAGAGCT AAAGAGAAAT GGAT CTT 
AT ACT C GTAGT TT AGAACAAT C GAC ACAT GAT TCTTTT ATAAGT CAT GTTAAACAAT T AC 
TTT CTACACCATTTATATCTCTAGGTATTACATATTT GGATGATTTTTT GCAGACTTATT 
TA^TC^TACTGAATCGTCTTCTTTAAACTTTCAACTGTTTACTTTAATAAATCACTGTT 
CAGAAAATACTT TAAAACGGATTTTAAAACACATTT CT AAAAAAAAT GAAAAAAAT CAAT 
ATGTAAAT CAATGGTTG ATTGATCTCATTACATGTATATATC T AATTATAAGAGAT GAAC 
AAAAT GTT ACAGAACAAGTTAAT GCCCTTTTAGTAACTAGTAAT CACTTAGCTTTACATT 
TTGCAAAGAAASCTA»GGTGGATTCTATCCrACAGCAGACAAOTTAG 
TTTTTTTCAAGAGAATAATTTTAGGAATACTTTCGCTAGCAGAAAGTATAGGTTGCTATA 
CTCTGAATCCATATTGCAAAAATCCTTTGAAAAAGTCAAAAGTAGAAGTAGAACCAAGTG 
ACGAAATGTATATGTTCAGCTT^AAAGGTGCACTTGAACATCCTGATTCCGAC'SAAGACG 
AAGACAGTGGACTT CAAAATGAATAAT TAT CAT AAAT GGACTTCTAATGTTATAGATGCA 
ATTCTATCAAACAAAGCTCTTTTAGCTATAAAAATTTTAAAAGTCAACCGTTTGCAAACA 
AATT GAAT GCTTT AGAATCAGCAGT TGT GC CTCCAAGAAAAGAT GATACT CCT GAAAT GA 
T AGCAAAT CTTTT AAAAGAATTAGT T GCT T TGGGAGCTATT CGCAGT GATGAAGTT GGCC 
CAT TAT ATT CT GACCTT CTTAT CAGAGTTCACAAAT ATAATAGCTT GAATGTTCAAT CAA 
ATTT GCAAACTTTAACAGGAGAC AT TAAAT CACTT CAATC C GATAT AATTAGAAGTT CCG 
ATATTCCCAATTTAAGTAATCAAGTTGTTTTAAATACATTTTTAAATTCTTTGCCCTCAA 
CTGTTACATTT GGACAACATAATTATGAAGCTTTTAAACAAACTCTAAGATTATTTGTTA 
ATGAGACACCTAATATTACAGTTTTTAGATCAGGAAATGATACTTTAATTCAGGTTAACA 
TAACAGGAATT CATACAATTAATTTGAATGATSCAT TTAAAAATTTAAAAAATTTTTGGG 
GAAT AGTAT T AACAGGT GAAT T T ATT C CAGGT GAT ATT ACAAG CAGACTAACAG CT AATA 
CAAGAGTACTGCTTTATTTTCTTGCTCCTTTTACAAATGATAATACATTCACP^CTGATA 
CTTTTCTAGCTTTACTCATGAAATTATATAGATTOACAGTTTCTTCTGCTrTAGATTTTG 
AAGAAGAAACTGAAGCTGAAGTAGAAAATGTAGCTGAACAAATAGGATCC 
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ACTAGTGCAGATTTTACAAAGACTTTAGGATATCTATTAAAAAACAAAGAAGAATCATTT 
TCCCCTCCCAAATCATTATCTCCTAGACAACTG<?GTATTTTAAGGTTCATACAGAAAAGT 
C T G G T AGAT AAAAT T GAT AGAAAT AAT GAAGAT C CAT GG GAT G CT T TA GAAAC T TT AT CT 
TATTCATTTTCTCCGTCATTTTATGAGGCCAATGGGCCTTTTATTAGACGGTTAATAACT 
TATATGGAATTTGCCTTACGTAATTCTCCTACTTACTTCAGAGAAATTTACTCCAACAAA 
TATTGGATACCACCCAATTCATTTTGGACTCAAAATTATGCAGACTTTTTTTCG , SAAAAG 
AAAGAAAAACAAAATTTCGAAACATTTGAACCGCGGGAACTTCCTTTACAAATCTCTGAG 
GAAGAAGCTGTCCCGCATACAGAAGATTTTCAGTCAGCCATCT^ 

ACTTCACTCCCTGCTCCTTCTGTGTCAGAATACAGTAGCGTGCCTCGGTGAGCTTTTTAC 
CCTCTCAGAGAACGTATCCAACAGAGCATTTCAAAGGCAGTCATGCCTCCTTTGACAGGC 
TATGTCGGAAAACAAATAGGTGAAACTATTTTCCCTGGTAGTGCAGATCTTGTAGCACCC 
GCTGCGTCTTTAGTT GCAGCACAATTGGTTGATT CAAGGTTTAATAACAGAAGACAAAGA 
TT GAAAGAC G CAGCCAGAAAGC GT C AC C G CT AT GTT AGAGAGAT GCAT AATATT T CT GAT 
AAAGAGT CAAAT G CT T CT AAT GATACG GT AATAT CAC CTTTGATTGGACATGGT T C GC GC 
ACTGAAAATCGTTTTGAATATTTGAGACCTAAAGGTGGAAATTATTTATACTAATAAftAA 
T CATAACAGACC T GAC GGGCGGT CATCCTTTTTTATTAGATGCAGAAAT TTGTAC CT C CA 
CCACGAATCCTTGCTCCAACAGAGGGTAGAAACAGTATTACTTATACGCCTCT GGCACCA 
CT G CAAGAT ACAACAAAAGT ATT CT T T ATTGACAATAAGT CTT C GGACAT T GAAAGTTTA 
AACTTTACTAATAAT CACAGTAACTTTTT TACAAAT ATTATT CAAAAT GCTGATTTGGCA 
G CG GAT GAAGCAG CAACGCAAGATAT TAAACTGGAT GAAAGATCTAGAT GGGGCGGTGAA 
CTGAAAACTTTTATAAAAACAAATTGCCCCAATGTTTCAGAATTTTTTAAGAGTAATAGC 
TTT CTAGCCAGATTAATGGTAGATAAAACTGATC CA6AACATCCTAAATACGAATGGGTA 
CAAATTACAATTCCTGAAGG CAATT ACACT GGAAGCGAACTTATAGATCAACTT AACAAT 
GGTATTTTAAACAATTACTTAGAAGTGG^AC^CAAAAAGGAGTAGAAATTGAAGACATA 
GGAGTAAAATTTGATACAAGAGATTTTT^ 

ACT CCAGGAAAATATACATATAAAGCTTTTCAXCCAGATATTATCTTGCTACCTGAAT GT 
GGCGTAGATTTTACATATTCTAGAATTAATAATATGOTAGGTATAAGAAAGAGATTTCCA 
TATACTAAAGGATTTCAAATTTTATACAOTGATTTGACGAAGGGAAATATCTCTCCATTA 
CTGAATTTAAATAACTATCCTCATTCTATCGAACCTGTAATGCJ\AGACGAAAATGGAGTT 
AGCTATAAT GTAGAAAAAATAAGT GACAATC CCC C CAGAT GGCAAACAAAGTACAGAT CT 
TGGACTTTAAGTTATAAAAATAATGGAGGAGCTAAAGCCCTAACTGTACTAACT GT TCCG 
GACATAACAGGAGGATTAGGTCAAATTTATTGGTCAATGCCAGA'TACTTTTAAAGCACCT 
ATTACTTTTACTAACAATACTACAAAGCCAGAAACACTTCCAATTGTTGGATTACATATG 
TTTCCTTTAAAAGCAGGGTTAjGTTCATAATATAAATGCGaTTTATTCTCAACTTTTGGAA 
CAAATTACWKATACAACTC^CTATTCAATAGATTTCCTAAAAATGCTATACTAATGCAA 
C CACCTTACAGCACCGTAACATGGATAAGTGAAAAT G£ CCCCTTTGTT GCAGATCACGGG 
ATTCAG C CAT TAAAAAACAGCCTTAC AGGT GT ACAAAOAGTT ACT ATAACAGACGACAGA 
AGGAGATCTTGTCCATACATACAGAAATCTTTG<^GACTOTTGTCCCTAAAGTACTTTCA 
AGTGCTACACTTCAGTAACAATCTGGCTC^TATCTCTGGGCCTTATCCTCCTG'SAACCGT 
TATGTCTATTTTAGTTAGTCCCT CT GATAATACCGGGTGGGGTATTGGAACAT CAAGTAT 
GAGGGCTACTGGCTTGAAATTTTCTAAAAAACAACCTGTTAGAGTGCGACCTTATTACAG 
AGCTCACTG^GGACAGCTTAATGCTCGTACTTC^CT 

ATATTATGAAAAATTGTACAGGGACAGACTAAAAAGAAAAACAGTTGTTCCAAAGAAAAA 

GAGGT CACCT ACAT CTCCTGCGGAT CGACTTAAAAAATATCTTAAAGCT GTCAGTCAAAT 

CAAAGCTTT CAATAGAGCTAGAAGAGCAGCCCAATAAATATTATTTTT CACTTGCAGAT G 

AAGGTAGTTCACGTGCTTAAATCTCCTCATCGTCGAAGACATACACCTCGTTACAAAAAA 

CTAAAAAAAATCAATCTATCTCCATACATTTTACCTAAAGAATTGCAAGGCGGTTTTTTA 

C CAGCTCTCATTCCTATCATAGCAGCCGCAATTAGC GCAGCCCCTGCTATAGCTGGAAGT 

GTAATAGCT GCTAAAAAXGCTAATCGTTCTTAAAATTTAGAAAACTTTTTTTT TAACAGA 

TCACATGGCTTTTTCAAGATTAGCTCCCCATTGCGGCTTAACACCTGTTTATGGCCACAC 

CGTTGGAATCTGTGATATGAGAGGAGGTTTCAGCTGGTCTAGTTTGGGAAATTCTTTTAC 

TTCTGGTTTAAGAAACATAGCTTCATTTATATCAAATACTGCTCAAAAAATAGGTCAATC 

ACAAGGATTTCAGCAAGXTCAAACAAGGTCTACTGCAATCAAATGTTTTAG^ 

ACAATTAGCAGGTCAAACTTTAAATACT TT GGTAGATATTGGAAGAT TAAAGGTAGAGAA 

AGATCTAGAAAAATTGAAACAAAAAGTTATAGGGAACGACCAACAAATTACTCAAGAACA 

ATTAG CT CAACTAAT AGCCAGCTTAAAACCAAAAGAT GAAAT GTTTGTAAAGCAAT C AGA 

AAAAATTGTTGAACCTATGAGACCAGAAATTAAATCTAGCCAAATGCCTGTAGAAATGTC 

TTTTTATGATTCTGTAAGT GATGAACCAAT CATAAAAACCAAAGAAGTTAGCCCTCCTTC 

ATTTTCATCTGAATCTTCACATTCATATTC7TCACCCAAGAAAAAGAAAACGCGTATCCGG 

TTGGGGTGC^TTTTTGGATAAJCATGACTGGJ^TGGAGTAAATTTTAATACAAGAAGATA 

T T GTTATTAAAAACACTTTT TATTTACAGAT G GAGCCACAGC GT GAATTTTTT CACAT TG 

CGGGTAGAAATGCAAGGGAATACTTGTCTGAAAATCTGGTACAATTCATCTCTGCCACTC 

AAAGTTTTTTTAATCTrTGGAGAAAAKTTTAGAGATCCTTTTGTAGCTCCATCGACGGGTG 
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TAACTACTGACCGTTCTCAGAJ^CTTC^CTTCGrATAGTTCCGATTCAAACTGAGGACA 
AT GAAAACT T T T ACAAAAC T AGATT TACT T T AAAT GTAG GAGAT AACAGAGTT G CAGAT C 
TTGGAAGTGCATATTTTGACATTGAAGGAGTTATTGATA£AGGACCTACTTTTAAACCTT 
ATGGAGGGAGAGCTTATAATCCATTAGCCCCAAAATCAGCTTTTCCCAATGCAGCTTTTA 
T GGATACTGATGAAGC TACAACAATT TATATTGCTCAAC TCCCTAATGCTTATAATGCT C 
AAAACAAAGGTGTAGAAGAAGCAATTCGAGTAGAAGCAAACACTACTACTCCTAATCCTC 
AATCAGGAGAATATGCTACTTATGACTCTGCCAAATTTAATCCAGAAACTACTGGTGCTT 
C7GGAAGGCTTTTAGGAATTAATAGCTTAGGAGATCTTTTTCCGGCTTATGGATCTTATT 
GTAGACCTCAATCAGCAGATGGTAACATTTCAACTGCACCCATAACTAAAGTCTATCTAA 
ACACTACT GCTACAGAT GACAGGGTCAGT GGAGTTACTGCAGTT GACACCGCAACCAGAT 
TGCATCCAGAT G CT C ATTATATT GAAT ATACT GAT GAAGCCAAAG CTACAGCTATAGGAA 
ATCGC CCAAATTATATTGGTTT CCGAGAC^ATTTTATTGGACTCATGTTCTACAATAATG 
GTTCTAATGCAGGAACATTTTCCAGC CAAACACAACAACTTAATGTT GTTTTACACTT GA 
ATGACAGAAACAGTGAACTAAGCTATCAATATCTAATAGCAGATCTGAC^GATAGGTATA 
GATATTTTGC AC TT T GGAAC CAAG CAGTT GAXAGT 7 ACGACCAGTAT GT CAGAATT T T GC 
ATAATGAAGGATATGAAGAAGCCCCTCCGGCCTTATCATTTCCTTCTCAAGGTATCCAA 
AATTATTTCATGCCTACTGCGGCAGGTAATGCGATGACAGTAGACACGGGTAGAAATACT 
GCAGCAAAAACAGArAACACCAAGGCTTTTATAGGATATGGCAACATGCCATCrTTGGAA 
AT GAATCTGACAGCAAATCTACAACGTACATTTTT GT GGTCTAAT GTAGCAATGTAT CTG 
CCAGATAGGCTGAAAACAACACCACCCAACATAAATCTACCTGATGACACCAACTCTTAC 
GGATAT ATAAAT GGAAGGGT C C CT CT AGCAAACATAATAGATACAT GGACTAACATT GGG 

gctaggtggt cattagat gttatggatactgtaaatccatttaatcaccacagaaattca 
g gactaaagtataggt cacaact gttaggaaat ggaagatattgcagat ttcac at t caa 
gtac ct caaaaattttt t cc tataaaaaat cttt t gttgct gccaggaacatataatt at 
gaat ggtacttt agaaa 

*ggatccgaacatggtttttcagtctacttta<^taacgaccttagagcagatggcgcaac 
tattacatacaccaacataaatttatatgtttgatttttccctatgaattatgaaacagt 
aagtgaacttgaattgatgttgcgtaatgctactaatgatcaaaactttgcagattattt 
gggtccggtaactaatctttat caaatcccagct aatacaaatact gtagtagtgaacgt 
ac cagat agat ctt ggg gt gcttt cagag gat ggagttt caat agaattaaagctt caga 
aacacctatgataggagcaacaaaagatccaaattttacttattcaggatctata 
actagat ggtactttctatttaacacacact tttcaacgagtttctattcagtgggattc 

TAGCGTTCCATGGCCAGGAGATGATAGGCTTTTGATTCCAAATTGGTTTGAAATTAAGAG 
AGATC CTAAT AT GGAC GCAGAAGGTTATAC T ATGAGT CAAAG T ACT ATCACAAAAGATTT 
TTATTTGGTACAAATGGCTGCTAATTATAATCAAGCTTATCAAGGTTATAAATTGCCAGT 
ACATTCTAAATATTATGGATTTTTAGAAAATTTTCAACC TAT GAGTCGC CAAG TAC GAAT 
TTATGGTAATGGCACTTATGATTTATAIACTGCTTATATTACAAACCAAAGAACCATGCA 
AATTTGGAATAATAGTGGTTTAGAATCTAAAACTTCAAATCCTCCTATGTTATCCAACAC 
T GGT CATCTTTAT G TAGCTAACTGGCCATACC CTT T GATTGGAC caaat GCTATT GAAAA 
C CAACAAAC TGAAAGGAAATTTTTGT GTGATAAGTATAT GTGGCAGATAC CAT TTTCTAG 
TAATTTTTTGAATATCGCTAATTTAACAGATTTAGGGCAAAGTGTTTTGTACACTAATTC 
TAflTCATTCACTTAATATGGTTTTTACTGTGGATAGTATGCCTGAAACAACTTATCTAAT 
GCTTT T ATTT GGT GTTTTC GAC CAAGTT GTTATTAATCAACCAACAAGAAGTGGAATAAG 
TGTAGCTTATTTGCGCCTTCCTTTTTCAGCTGGTAGTGCAGGAACATGAGCGGCACATCC 
GAAAGTGAGCTGAAAAATCTGATTT CATCATTACATTTAAATAATCGATTTTTGGGCATT 
TTTGATTGCACfcAXTTCCATCTTTTCXGCAAAAATCTAAAATTCAAACl t GCTATTATTAA.T 
ACAGGT CCCAGAGAACAAGGC GGAATACACTGGATAACATTAGCAT TAGAACC CATTT CT 
T ATAAGCTATTT ATATTT GAT CCACTC G GAT G GAAAG ACACT CAATT AATTAAAT TTTAT 
AATTTTTCACTAAATTCTCTTATTAAAAGGTCGGCCTTAAATAACTCAGACAGATGTATT 
ACAGTAGAAAGAAATACTCAAAGTGTTCAATGTACCTGTGC GGGATCGTGCGG CTTGTTT 
TGTATATTTTT CTTATACTGTTTT CACTTTTATAAACAAAAT GTATTTAAAAGTTGGCTT 
TTTCAAAAATTAAACGGTTCAACCCCTTCTCTGATCCCATGTGAACCACATCTATTACAT 
GAAAACCAGACATTTCTTTATGATTTTTTAAATGCAAAAAGTGTTTATTTTCGiAAAAAAT 
T AT AGAA CAT TT ATT GAAAAT ACT AAGAC T GGATT AATAAAAACACAT TAAT T GT ATT CT 
TGCTTTTTGACGTTTTCATTAGTCTTCATCTTCATCTTCTTCTTCACTGCTAGiATTCCAA 
GATGGTTT TTTTTTTCTTT GATGGAGTAGGCTCTTCAATAGTTCCAAAAGGATTC ATATC 
AGAATCCTCTTCTATGTTAGGCAACATAGTATTTTTAACCTGGAATGACTGATTCCACTT 
AAAT TGAGAAAACTGAATTGGAATGTT ATTT CCCATACATTCATTCCAAAATTT AC GCAC 
AAGAGTTAAACACTGTAACATATCTGGCAAGCTAATTTTCATCT CACAAAATTTTCCATT 
ATT AC GT CTCAAGTT GT AT T GATAGTTACAACATTGAAACACAAAAACAGCAGGGAAT OT 
AACTGCTGCGGCCTGAACTCTATTAACATC CTGAACATCAATTC CTTCCACTCCAGATAT 
AGAAAATGGAGTTATTTTAGGGAGTTGTTTTCCTATTGTTTGTTTGCCACCATAATTACA 
TTCACACTGACCCAATATAAAAAGCATATTTCCGACTTTAGCTTTCGGAAACACAGCTTT 
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TCTAGTTTCAATGGCATTTTGCATAGCCAGCAAGGCCTTCTTTTCATCTGAAAAGTTAAG 
AC CACAACT G C G AGGAG AACAT T GC C CAAAAC GC T G AT GG GCAT C CT CAGCACAT AACAC 
GTAATGTTCCTOAACTATTTTTACTACTTGTTTATrCATACGCCCATTACTAAGAACACC 
CCTCCCTTCCTTTAGGGCTTGCACCCCTGCTTCCGATGTTGGAOGCATTTCAATTTCATT 
CACCCTTTTAAACATGAAGTCACCATGAAAACATCTAGGACGGTCCTCCTCCCAATCArG 
ATACCACAAATAACAACCAGAAGC^TTAAAGTTTGGAATCAAGTCAATTTGCTTACAAAT 
T GCACTATATAGCATT CT ACCTCCTACAGT AGCCATAGATTTACTGCT ACTATAAGTCAA 
ATTTATAATTTTC^TCTTTTTC^TGTACTC^GCAAATAATTTTTCACAATCTCCTTCTTC 
AGGAT GAAACTTCATT TGACTGGTATCAACTTTAACACACTCTCCAAATTTAG C TAAAAT 
TTCGAGCGCCGCTTGAACTTTATTCTGAAATTCTTCTGTAGTAGATTTTCTCTTCTTGAT 
AGATTTAGTAACTTTTTTAGAAGACATTATGTTAGTTTtTTTCTCGTTGTAGGATGGCTG 
AAAAAAATATGGGAGAGTCAGAGAAGG GTTTGAACGAAGAAGAATTTAACTCTATTCTAT 
CAAAACATCT GGAAAGAGAAAT TAAAAT CT GTAAAGCGTTAACAT CAAAATTATC GAACT 
GGAATATTGGAACATTGTTAGAAAACT TGTTATTTT GTCCTGATGAAAGACAATCATCAG 
GTGATCCCGACCCAAAACTAAACTTTTATCCGCCTTTTTTAATTCCGGAATGTCTTGCAff 
T G CACTATCCATTTTTTCTAACAACTC CTATTCCGCTATCATGCAAAGCGAAC AAAATAG 
GAACTAACACTTACCGAAAATGGATGAACAATCAAGTCCTGGATTTACAAATACCTTCCT 
T GGAAAATT GCAAATGGGAT GATAGCT TGGGAAATGT AGATTTAAT T GAAGAGCT TAAAG 
AGAACCAAAAACTTGTTTTAGTAAAACAAGACCATGAAAGAAATATATGGTTTAAATCAA 
AATGCAAACAACTTCAAAGTTTCAGCTATCCCTCACTCAGTCTGCCCCCAGTTTTACAAC 
AAGTTTTAATTGAATCTCTTAT CGGCATTAGTCAGGAT CCTAATAACTTTGACAAAAATT 
ACGAACCTGCAATAACTCTAGAAAAACTACAACATGTAAACTGTGATCAAGATTrAAAAC 
AAGTT CAACAAAAAGTATCTT CAG C C GCTACATACGGAATACTTT T GAAATGCAT T CAGA 
CTTTATT CAGT GACAAATTATT CAT T CAAAACT GC CAGGAAT CATTACATTACACCTTTA 
ACCAtGCTTATGTAAAATTACTTCAATTTTTGAGAAATGTCACTTTAAGCGAATTTGTAA 
CTTTCCATGGTTTAACAQVCAGGAACAGACTCAATAATCCGCAGCAACATACACAATTGG 
CAACCGAAGACAAAATAGACTATATCATAGATACAGTGTATTTATTTTTGGTA'TTTACGT 
G GCAGACAGCAATGGATATTT GGAAT CAAACAT T AGATGATAAAACAAT AAATATAATT A 
AAGAGGAATT AAAC CAAAATTTT GAGAAAATTGTCAAAGCTGAATCAGTT GATGAAGTTT 
CTGAAATTTTAAAGTCTATTATTTTCCCTC3AACTCATGCTGCC3AGCTTTTTGTTCTAATT 
TACCTGATTTTATAAATCAGAGTCAGATATCAAATTTTAGAAACTTTATCTGCATTAAAT 
CCGGCATACCGCAGTCAATTTGCCCCCTATTACCTTCAGAXCTAATTCCTTTAACTTTCC 
TAGAAAGTCATCCAATACTCT GGAGTGATGTAATGTTACTAAATCTT GCTTCATTTCTAG 

taaaccaaggcaattatttgcatgaacccgaaaaacctttaaatatttcatcagtttact 

gtaawgtaatttatgctctccgcaaagaatgccatgttacaatagcagtttgatgcaag 

aaatactaaccattgataaattcgagttcacaaactctgataaaacaaaacagctaaaac 

tgaccctccaaacttttgctaatgcctatcttaacaaatttaactcagcagaattctacc 

atgaccaagttttattctacaaaaactgtaaaagtaaattttctaaccaatta^cag ctt 

gtgtaataaaagacgaaaaattattggctaaaatagcagaaattcaaataacgcgggaaa 

aagaactcttaaaaagaggaaaaggaatttatttggatccagaaacaggagaaatcttaa 

acaatggagaagccatatcatcctctgaaaacttccaaaggcaaagaactagctatgctc 

tacc^tcaaatgaaggagagcgagctggatggoaagccgatgagcgaagaagacgaagga 

gaagtgagtgaggatgaaacagagacaacaattccaaagaaaal'gaagtttacaagtaag 

taagctctaaattttttatattaaaaact gaatttttttagacaaaattattt taaatta 

aatctttatagctagcagttgatctttgttcgtttttcagaaaactcaagtgttcagtc 

at at caagttcacttgcctctgaaacac gaaattgcggaaatt CTAGAAAAAATTAGACT 

agaat CTAAAAAATAT CC AGGAAAAGTTTATCAAATAAGAAATAGAACT ccagc aagtat 

tacaaaacgatacctgtatgaaagagatctgaagaaactgttccagtatctagaagacgc 

aaagaagctttacgctaagtaccaaagctgaggctttatagttttaaattttcccgccat 

ggctcaaccagtgac gccttacgtct ggaaataccaaccagaaacaggatatactgctgg 

agcccatcaaaattataacactgttato^ctggttgcatgccaatccacaaatgtttgc 

cagaattcaacatataaacaccgcacgcaatgttatggacaaattccgctctgatttgac 

ccgagatgacatcgcggttaacatcaacaactggcctgcagaggatttaatgcaac 

taattttccttacattcctgcgacctctaaatccgcttcaaccataaatgactggttggc 

t accact caaggaattcaactcagtg gaactagtgaactaaacgggtggggatctaaccg 

cctgacttcctatccggatattccacccattttaaagtatgaaag<5cctggtcaacaact 

tcaagoccaaggactttttaagcaagaaaatattcatttattttacgaatctccgcgcct 

ccctcgctctggag^ttaactccccaacaatttgtaaaagaatttccgcctgttgttta 

taataaccccttctcagaatctatgagtgtatttccgaaagaatttagtcctttgtttaa 

ccctt cagaatctttgaaaaaaacat ccagtcaaactttacaatataaataaaaaacttc 

tattgatctttatacttacactaaagcatcgcgtttattttcgtcgccataaaaatatat 

caaagacccgtaattctctaactttaaatcattttttgaactaatcttaatccatttaaa 

tgtaggaattaatatatcagaaaccagtaacaagccagaattaaaaxatacttgtgtcat 



7 



TTT T ACAGAT GAAGC GAGCAC GCT G GGAC C C C GT T TAT C C CTT TT CT GAAGAG A'SACT GG 
TTCCTCTGCCTCCTTTTATT<3AAGCCG<3AAAAGGGCTAAAAAGCGAAGGGTTGATCTTAT 
CTTTAAACTTTACTGATCCTATCACTATAAATCAAACCGGTTTCTTAACTSTAAAATTGG 
GAGATGGAATATTCATAAACGGAGAGGGTGGCCTATCAAG CACTGCTCCAAAAGTCAAAG 
TTCC C CTGACTGTCT CAGATGAAACATT GCAACTGCTATTAAGTAATTCTCTAACAACTG 
AGTCAGACTCTTTAGCTTTAAAACAACCGCAACTTCCCCTAAAAATAAATGATtjAGGGGA 
GTTTAGXATTGAACTTAAATACTCCTTTAAATCTACAAAATGAGAQATTGAGTTTAAATG 
TTTCAAATCCACTAAAGATAGCGGCAGATTCTTTAACTATAAACTTAAAGGAACCCCTAG 
GATTGCAAAATGAAAGTTTGGGCTTAAATCTAAGTGATCCTATGAATATAACTCCAGAAG 

gaaatttaggtattaaattgaaaaatcctatgaaagttgaagaaagttctttagccttaa 

ACTATAAGAATCCTCTCGCCATTAGTAATGATGCGTTAAGTATAAACATTGCGAATCCAT 
TAACTGTTAATACAAGCGGATCTCTAGGAATATCTTATTCXACTCCCTTACGAATTTCAA 
ATAATGCTTTATCATTATTTATAtSGAAAACCTTTAGOATTAGGAACTGACGGCTCTTTAA 
CTGTAAATTTAACTAjG<5CCTCTGGTATGTCGTCAGAACACTTTGGCCATAAACTACTGAG 

c ccc^ctagtgtc^t tgcaagacaat cttacrtttaagttatgctcaaccatt aactgtaa 
gcgataattctttaagattgtctctaaattctccactaaacacaaatagtgatggaaaac 
ttagtgtaaactatt ctaatcctttagtt gtgactgactctaatcttaccctcagtgtta 
aaaaacctgtaatgat7aacaacacaggtaatgttgacttaagctttacagctcccataa 
aat t aaat gat g cag aacagt t gact ttagaaac cact gag c c ctt g gaagt ggc c gat a 
acgctctaaaactgaaacttggaaaaggcttaactgttagtaataatgctttaaccttaa 

ACCTTGGAAACGGTTTGACTTTCCAACAAGGTCTTTTACAAATTAAAACTAATAGCTCTC 
TAGGGTTTAATGCTTCTGGGGAATTATCAACAGCTACAAAGCAGGGAACCATAACCGTTA 
ACTTT CTAAGCACAACTC CTATACCTTTT GG GT GGCAAATAATAC CT ACTACTGTAGCT T 
T CATTTATATTT TATC AGGAACACAATTTACTCCT CAATCCCCAGTAACTTCTTTAGGTT 
TTCAACCCCCACAAGACTTTTTGGATTTCTTCGTTTTAAGTCCGTTTGTTACATCTGTAA 
CTCA^TTGTGG<jAAAT<^TGTTAAGGTTATTGGCCTAACTATTTCTAAAAACCAATCT 
CCATAACTATGAAATTTACTTCTCCCTTAGCTGAAAATGTAGCAGTTAGTATGTTTACAG 
CACAT CAATTCAGACAATGAATATTTTAAAAATTCTT TATTAAAGAGTAATCTTTTTACA 
T ACCGTTCTT GACATAAT GTG CCT CT ATAATTAACAAATCTAAGCAAGCAAGGTTGATCA 
TTGGAATCTATAGAAGCATAACTCTTCCAATAAGCATAATCATATGGCGGTAAATGAAAA 
C CC CTT AAAT CT AC CAT AT TCAT CTTTAAGT GT ACAGTAT CTAACAGGTTTTTACAAT CT 
TGCACTTCT GGACTTTTAAAAACAAACAGTACTTTCATAGGACAACAATT GTAAC G GTTA 
TAATCTGTTACAAXTTTACTTATTTCTTCTTCCAATGGCAAAGOVTTCCAAAGTCTTGTT 
ATAAGTACTGTAAAATCATCAAATGAATAACATAACACATTTGTACAACAATTGGTCCAA 
GGTAAAAAAACAGGCACACGAACAT GAACT T TTTTTAAAATT AACAT CAGT GT CT GTT TT 
AAACTTT (JACATTGCAAAGAATT T GGCTGCAAGCAAT GACAAT GAAAT T GATTTT GCT GA 
CAAfiGTAAGTCACACAAATACAACTTTAACAGCCTAAATATAACAACATTAATGTAACTT 
TCCAAGACTTTAAAACTAACAAAC G GTATAT CACAAT AAAAAAGATGAT GAATCCCTTCG 
CAACACAT AAT GGAGTT CATGCTACATCCAAAGATGGTTC C GACAAAC CTCT GTAAATTA 
AAGAACAACAATACAACATACGAAGAAAATTAAAACGTTTTTCAAAACGAGATATACATT 
GCTGCAAAGTATCTGAACATTTACATTTTATACTTATAAGCTCACAAGTTTCAGAAAATG 
TAATTCGTTTAACAGTTTGATAT GAATACCATTTTGAAGAAAAAT 
C^TCTTCOlTCACTCCAaAAAATAAAAAAT 
agaaaQagttttgtg 

catttgtQaagctcccagaaacattaacggacaXgcaaatccaagtattacaacaaacagg 

aacagtcttaacgtttcgttcagaaaacaaactaacaggcatatgattaaagcaagacaa 

taaaacacttttggcagctaaacattgcaaagatccaggtgaattacaatgacaatgata 

ataaaacttataagccatatcggccctcttgcaaaacgaatcagctttttggcttatagg 

aaaataacaaaaaaactgattatatatgaatggagttaatatcttcttcaaattatacac 

acgaatagcagaaccaagacgaccacgcccaacacaggtaaatat ttcaagtccatgact 

aggaacagatggtttctcacaagavac^ctttgaxttgcttatccatcact 

ggcttaataggaaaagaagaaaaataattttcccaataataacgaaagaaattccacgtt 

tcatcctgtacattactagtcacaaatacaacctccsctatcaaagatxccctatcattt 

aaaactcc»ccaaattgtcccagtctacctcaaaaaagccagttccc^tattttcaaaa 

tttgcccattttaaataatccaaagcatcaaattcaggaaacaaatctttctgagctaaa 

acatatacaottttatcgcc^ttaaatctaaaagccatcctaaatggacctctagcccag 

tagtttaagtaccgggaagagactatacaatatacttgatattgatgtctottaagtggt 

gataaaaaagaaagtaattcagaattaggataaagcattctcccatgttgattcatctac 

aaaaaacaaaaaaattataaggttcatagaaaacctactatttaacaaatctataaaaat 

g c attaaaaagtt ac ct t gaatataaattcagat cacctaaaaaacgaaaaaaaataaca 

tttatgttagtaaatgatagtctttaaaaattagaaaagaatcaagtcgctttta!tactt 

acaaactccaaataaat tctgtaaccaagagaaaaattgtaacctaaaaggtaaagaaga 
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ACAT T AT AAGATT AAAAC CACT CT AAAA? CT G AAAA 3 CAT TAT GAAAAAT T CT GAT AGCT 
GCAACTTACTAGTCTTCTCCAAATGTTGCAGGCATTTCAAAAAATCAAGAGGAAAACCGG 
AGTTTATAAAGTAGTAGTCTGATTATATCTGAAAAAGTTTAACTTCCTTTTCAACCCAAC 
CCAGTCCAATAAAATTCCAACCTTAACTTCTTTCCTGCTAAAACTCCATAAAAGTCCAAT 
TACCACTTGACTTTTATTTAACCTCAATTATGTTACATGTTATTCTACCCATAAAAACTT 
GATGACCAAGAACTGACCTTTCCCATGTTTTTCTGAAATAACAAAAATGTTGATTTAAAG 
ATTTTTAACTACCCAAAAAACCCGCTCTCATGATTTTTTCTTATATAAACAGGATACAAA 
AGAACTGGCAAAGATATTCCATCATACTTCTCCAACTGTCAAAACATACCACTTAACCTC 
TCCCATGTTTTTTCCCTTTTGCACAAACAGGATATAAAAAATATTTTTGCCACAATGTTT 
TTCCTTTTACTCAACTGCCAGAATAAAAATGAACAGCTTAACCTTTTTCCCTCTTAACCC 
ATTGCGTTCCTCTAAGAAAAAAATTATCCCGCCCAATATGCTAAAGGCTTCTCCCGCCAA 
AACAGCT CAACTT AAAATCT CT CAT GAATAAAAC C CAGAGAAAATTT CCAGT AATAAAAA 
TTAATAACCGTGAAGT ACTAGAT CTAATAAT GATATTTT GAACT CATAAAAAT CCAC CAT 
CCATGTAAT GT TAC AAACACTTTTTT ATT GAGTTTTTTCTTACAACT GCATTACAT ACAG 
GCCAAGCATCAAACTTTCTTCTGTATTTCTTCCTAGACCACAAAATTACAGACTTATATT 
T CTGCCACAAATCT C TATGATCTTTAC AGTAACACTTACATTTAAATGGGGAATACAGCA 
G CAAATAAGGAT GAGTTAAACAT GC GATACAAT GAC CAGAAGGAASATAATACAATACAT 
CACACCAAAAT GAAG GTACAGAGAACAT C GCAT GAAAT CTTAAAT GTGATTTTAC AATAA 
ATTTCTGCAGCAGCTTACAATCTATATTAGCAAACCGTTTTATATACAAACATAAAAACT 
TGGAACTTTTCACCAACTCAATCATGTTATTATAACACATTACAAATTTTGCTATATCTT 
TATTTGTCyVAATAACAAAATATCTCAATCCACAGMCATCTGOCAGCAAACTTCGCAAAT 
C CATGACCT GTAAAAGATACAACAGAAAACAGAAAATTAATGC CATT CAATAAC ATAAAA 
AATACAGTC AAATCACAT ACTTTTT C T CACTTACAAAACT TTGTGAGCAGGCCT CCAAAA 
C AAACTT CAGAAAAT GGAT G CAT ACAAGAACATTCTCCT CTCAAAAATT GCTT T AACTGA 
ATGCGGCATTT T GC ACCT C CAGAAAAAT GCAGT C CAT T GAGAGG CT CT T CTCTTAAAACA 
CAGAAATGCTT CT G CAAAATCTGTAAAGAAACTAACAACT T CCAAATT C CAATCATCATG 
C ATTG CAAAGAAGGACAT TCAACAGCAAAAGGAT CGT GAT GAGCCAATAAAGCTTTACT G 
TATWCTGATTTTCA^GAATTACAGTCTGTAACTTACTATAArGCATTTTAAGCTCTGCT 
T CACAAATTAATAATGCTAATTT CTT TAAGCAGCT CAAAGAAAACTCAT CAGGAC AAC GG 
CATTT AAGAAAG CAACAAAAT GATTT CTTAAAATACATTTTTC CAGCATGAT GAACAAT A 
AAAAATTT CAAC GTTAAACAAT GCAAAAAT G CATTT TTAT GCACAGTGAAAGTAAT T TT T 
TCAGCTGAAGCTAAATCACAGCCTATTTTATTACATGATTTTGTATGCTCCAAAAGAGCT 
TGTTTTAATTGCTTCAAATCCATCTTCTTACAATTTTTTCTTTTTATAAACACCAGAACC 
GCATTCAGGCCAATTCC^GTTATTGTTTAAATTTGCTACAGAAACTGCAGACCACAAAAC 
CACATCCTCTAAAT CAACCCACAAAGATCTATGAT CCACACAAAAACACAAAGAATGATA 
CGGAGAATACAACAATAAAT GGG GAT T AACAAGGGAC GCAACACAAT GACCC GAAGGT AA 
TAAAGTTTTAC AGCAC CAATTACAAGCAACAG GTAAT GGAGTATATT T CCCAAT GCGACG 
AGAAAGCCGAATGT CATT CAGAACAGCATTGCATTT TAT CTTCTCAAACCTCTrAAGGTG 
CAATT GTAT AAAATAAGAAT CCTTAAT GACAGTGAT GAATT GAGGAAAAGCAAAAACAAA 
ACTAGCAAT GT CTTTGCTT GTAAGTTTCAAAAATATCTTCATCCAAAT CTCAGTCGGTAA 

ttcj^caaaaaattcaggCgcctacaaaattaatcagactaatttaatatca^ 

acagcgaaaagaaaaaataacac^cccaaaaataaaaaactcttacccctgttatccat 

gagatacac^gaaaaattcagaacactcagtgt catgtttcttaaatt gttcccaaagct 

cagagattctaagccaaaaattttttgagaactgcaaaaacccagtttttataacaaagc 

cttaatgttttcttaactgatttaactgccctaacaggaactccacattccggccaccgc 

c^cccagqjgacaaatcttgccaagaactacaagtccataa^ 

taccaaaggtttctatggtcgac^caattacaacctgacctaaaaggtgaataaagcagt 

aaataaggatgagttaaac^ggccacac^tgtccagaatgtaaaaaatgcittgtttgg 

CACCAACGAjGACC^CAGCTGAAGGAJVAGGAAAATTGTA^ 

CTGTTTAACACAGAACVACATTCAATTCTGGCAAACCrrCTTTAAAAAATGTTTTCTGAAA 
tatttctttaaaatgacagt t TGCAACT CT g GAAAACACAAAATAAAAGCCGCAATAT CT 
CTACTGCTTAAATATAAAAATATCATTGTCCAAATTTCTACTGGTAAAACTGAAAGCATC 

ttcttcctattaaaaaaagaaaagtgttttcaaattata^ 
tcaaatacttttcctttataatgtacattaagaataaa 

agaacttaacagtataatataaatacaagtgagctgaacaacgacagccgatttcagccg 

gagcaaaattaaaaagaataaaaggatcaaaccaacacgtagga 

agtaacggcagtatgacacagaaggafiagg^actaagtccaggaaacttcgcccggtgcg 

ataaaaagtaacgccgccggaaagcagttgaatacaaaagaggtaaaaattcacgaaaaa 

cagaagcaaaaactactaaatct gctattggcaaataaajgaaaaatttcaaaccatattt 

ccaaaggaagaaaago^tcataccgtagaa(^cctgaaggcgaccgcaaacgtgctcc 

cgtaccacaacgtcacacgccacacccactgggaaaacccacacgccccgcctctgtgca 

acgttatatatatgaatag 
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end 0AV287/start Bluescribe sequence 



GTACC CTTTGTTCCCTTTAGT GAGGGTTAA 

TTCCGAGCTTGGCGTAATCATGGTCATAGCTGTTTCCTGTCTGAAATTGTTATCeGCTCA 
CAATTCCACACAACATACGAGCCGGAAGCATAAAGTGTAAAGCCTGGGGTGCCTAATGAG 
TGAGCTAACTCACATTAATTGCGTTGCGCTCACTGCCCGCTTTCCAjGTCGGOAAACCTGT 
CGTGCCAGCTGCATTAATGAATCGGCCAACGCGCGGGGAGAjGGCGGTTTGCGTATTGGGC 

gctcttccgcttcctcgctcactgactcgctgcgctc(sgtcgttcggctgcggcgagcgg 

tatcagctcactc^aaggcggtaatacggttatccacagaat 

agaacatgtgagcaaaaggccagcaaaaggccaggaaccgtaaaaaggccgcgttgctgg 

CGTTTTTCCATAGGCTCCGCCCCCCTGACGAGCATCACAAAAATCGACGCTC^GTCA<3A 
GGTGGCGAAACCCGACAGGACTATAAAGATACCAGGCGTTTCCCCCTGGAAGCTCCCTCG 
TGCGCTCTCCTGTTCCGACCCTGCCGCTTACCGGATACCTGTCCGCCTTTCTCCCTTCGG 
GAAGCGTGGCGCTTTCTCATAGCTCACGCTGTAGGTATCTCAGTTCGGTGTAGGTCGTTC 
GCTCCAAGCTGGGCTGTGTGCACGAACCCCCCGTTCAGCCCGACCGCTGCGCCTTATCCG 
GTAACTATCGTCTTGAGTCCAACCCGGTAAGACACGACTTATCGCCACTGGCAGZAGCCA 
CTGGTAACAGGATTAGCAGAGCGAGGTATGTAGGCGGTGCTACAGAGTTCTTGAAGTGGT 
GGCCTAACTACGGCTACACTAGAAGGACAGTATTTGGTATCTGCGCTCTGCTGAA5CCAG 
TTACCTTCGGAAAAAGAGTTGGTAGCTCTTGATCCGGCAAACAAACCACCGCTGGTAGCG 
GTGGTTTTTTTGTTTGCAAGCAGCAGATTACGCGCAGAAAAAAAGGATCTCAAGiAAGATC 
CTTT GAT CTTTTCTACGGGGTCTGAC GCTCAGTGGAACGAAAACTCACGTTAAGGGATTT 
T G GT CAT GAGAT TATCAAAAAGGAT CTTCACCTAGAT CCTTTTAAATT AAAAAT GAAGTT 
TTAAATCAATCTAAAGTATATATGAGTAAACTTGGTCTGACAGTTACCAATGCTTAATCA 
GTGAGGCACCTATCTCAGCGATCTGTCTATTTCGTTCATCGATAGTTGCCTGACTCCCCS 
TCGTGTAGATAACTACGATACCGGAGGGCTTACCATCTGGCCCCAGTGCTGCAATGATAC 
C G C GAGAC CCAC GCT CACCGGCT C CAGATTTAT CAGCAATAAACCAGC CAGCCGGAAGGG 
CCGAGCGCAGAAGTGGTCCTGCAACTTTATCCGCCTCCATCCAGTCTATTAATTGTTGCC 
GGGAAGCTAGAGTAAGTAGTTCGCCAGTTAATAGTTTGCGCAACGTTGTTGCCATTGCTA 
CAGGCATCGTGGTGTCACGCTCGTCGTTTGGTATGGCTTCATTCAGCTCCGGTTCCCAAC 
<^TCAAGGCGAGTTACATGATCCCCCATGTTGTGCAAAAAAGCGGTTAGCTCCTTCGGTC 
CTCCGATCGTTGT(^GAAGTAAGTTGGCC<KAGTGTTATCACTCATGGTTATGGCAGCAC 
TGCATAATTCT CTTACTGT CATGCCATCCGTAAGATGCTTTTCTGT GACTGGTGAGTACT 
CAACCAAGTCATTCXGAGAATAGTGTATGCGGCGACCGAGTTGCTCTTGCCCGGCGTCAA 
TACGGGATAATACCGCGCCACATAGCAGAACTTTAAAAGTGCTCATCATTGGAAAACGTT 
CTTCGGGGCGAAAACTCTCAAGGATCTTACCGCTGTTGAGATCCAGTTCGATGTAACCCA 
CTCGTGCACCCAACTGATCTTCAGCATCTTTTACrTTCACCAGCGTTTCTGGGTGAGCAA 
AAACAGGAAGGCAAAAT GCC GCAAAAAAGGGAATAAGGGCGACACGGAAATGTTGAAT AC 
TCATACTCTTCCTTTTTCAATATTATTGAAGCATWATCAGGGTTATTGTCTCATGAGCG 
GATACATATTTGAATGTATTTAGAAAAATAAACAAATAGGGGTTCCGCGCACATTTCCCC 
GAAAAGTGCCACCTGACGTCTAAGAAACCATTATTAITCATGACATTAACCTATAAAAATA 
GGCGTATCACGAGGCCCTTTCGTCTCGCGCGTTTCGGTGATGACGGTGAAAACCTCTGAC 
ACATGCAGCTCCCGGAGACGGTCACAGCTTGTCTGTAAGCGGATGCCGGOAGCAGACAAG 
CCCGTCAGGGCGCGTCAGCGGGTGTTGGCGGGTGTCGGGGCTGGCTTAACTATGCGGCAT 
CAGAGCAGATTGTACTGAGAGTGCACCATATGCGGT GT GAAATACCGCACAGATGCGTAA 
GGAC3AAAATACCGCATCAGGAAATTGTAAACGTTAATATTTTGTTAAAATTCGCGTTAAA 

tttttgttaaatcagctcattttttaaccaataggccgaaatcggcaaaatcccttataa 
atcaaaagaatagaccgagatagggtt gagt gtt gt t ccagtttggaacaagagt c cact 
attaaagaacgtg<iactccaacgtcaaagggcgaaaaaccgtctatcagggcgatggccc 
actacgtgaaccatcaccctaatcaagttttttggggtcgaggtgccgtaaagcactaaa 
tcggaaccctaaagggagcccccqatttagagcttgacggggaaagccggcgaacgtggc 
gagaaaggaagggaagaaagcgaaaggagcgggcgctagggcgctggcaagtgtagcggt 
cacgctgcgcgtaaccaccacacccgccgcgcttaatgcgccgctacagggcgcgtcgcg 
ccattcgccattcaggctgcgcaactgttgggaagggcgatcggtgcgggcctcttcgct 
attaggccagctggcgaaagggggatgtgctgcaaggcc5attaagttgggtaacgccagg 
gt tttcccagtcacgacgtt gt aaaacgacggc cagt gaattgtaatacgact c actata 
GGGCGAATTCGAGCTC GGTAC ' end of Blueacriba sequences 
KpnX site with 5' base 



